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Cloned DNA Polymerases from Thermotoga and Mutants 
Thereof 



Background of the Invention 



Field of the Invention 

The present invention relates to a substantially pure thermostable DNA 
polymerase. Specifically, the DNA polymerase of the present invention is a 
Thermotoga DNA polymerase and more specifically a Thermotoga neapolitana 
(Tne) DNA polymerase or Thermotoga maritima (Tma) DNA polymerase. 

... . -u„„„^,«i^,,i«r W pipht of about 100 kilodaltons. The 

Preferably, me polymerase no, « w^.— 

present invention also relates to the cloning and expression of the Thermotoga 
DNA polymerase in E. colU to DNA molecules containing the cloned gene, and 
to hosts which express said genes. The DNA polymerase of the present invention 
may be used in DNA sequencing, amplification reactions, and cDNA synthesis. 

This invention also relates to mutants of the Thermotoga DNA 
polymerase, including Tne and Tma DNA polymerase. Specifically, the DNA 
polymerases of the present invention have mutations which substantially reduce 
3'-5' exonuclease activity; mutations resulting in the ability of the mutant DNA 
polymerase to incorporate dideoxynucleotides into a DNA molecule about as 
efficiently as deoxynucleotides: and mutations which substantially reduce 5-3' 
exonuclease activity. The Thermotoga (e.g., Tne and Tma) mutant DNA 
polymerase of this invention can have one or more of these properties. These 
DNA polymerase mutants may also be used in DNA sequencing, amplification 
reactions, and cDNA synthesis. 

The present invention is also directed to novel mutants of other DNA 
polymerases which have substantially reduced 5'-3' exonuclease activity. 
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Background Information 

DNA polymerases synthesize the formation of DNA molecules which are 
complementary to a DNA template. Upon hybridization of a primer to the single- 
stranded DNA template, polymerases synthesize DNA in the 5' to 3' direction, 
successively adding nucleotides to the 3'-hydroxyl group of the growing strand. 
Thus, in the presence of deoxyribonucleoside triphosphates (dNTPs) and a 
primer, a new DNA molecule, complementary to the single stranded DNA 

template, can be synthesized. 

r m, a . n i„ mM , c « have been isolated from mesophilic 
A numoer ui u^r^ pui;i..>..u... — - 

mtcroorganisms such as £ coll A number of these mesophilic DNA 
polymerases have also been cloned. Lin « ai cloned and expressed T4 DNA 
polymerase in* coli^ac. N*L Acad ScL USA «:7000-7004 (1987)). Tabor 
e, ai (U.S. Patent No. 4,795.699) describes a cloned T7 DNA polymerase, wh.le 
Minkley a ai (J. Bid Chem. 259:10386-10392 (1984)) and Chatterjee (U.S. 
Patent No. 5,047.342) described £. coli DNA polymerase I and the cloning of T5 
DNA polymerase, respectively. 

Although DNA polvmerases from thermophiles are known, relatively 
little investigation has been done to isolate and even clone these enzymes. Chien 
et al, J- Bacterial 727:1550-1557 (1976) describe a purification scheme for 
obtaining a polymerase from Thermos aquaticus (Taq). The resulting protein had 
a molecular weight of about 63,000 daltons by gel filtration analysis and 68,000 
daltons by sucrose gradient centnfugation. Kaledin et al, Biokhymiya 45:644-51 
(1980) disclosed a purification procedure for isolating DNA polymerase from 
T aquaticus YT1 strain. TTie purified enzyme was reported to be a 62,000 dalton 
monomeric protem. Gelfand et al (U.S. Patent No. 4,889,818) cloned a gene 
encoding a thermostable DNA polymerase from Thermus aquaticus. The 



polymerase from a fhermotoga spenes v - 
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(1990)). The purified DNA polymerase isolated by Simpson et al. exhibited a 
molecular weight of 85,000 daltons as determined by SDS-polyacrylamide gel 
electrophoresis and size-exclusion chromatography. The enzyme exhibited half- 
lives of 3 minutes at 95°C and 60 minutes at 50°C in the absence of substrate and 
its pH optimum was in the range of pH 7.5 to 8.0. Triton X-100 appeared to 
enhance the thermostability of this enzyme. The strain used to obtain the 
thermostable DNA polymerase described by Simpson et al was Thermotoga 
species strain FJSS3-B.1 (Hussar et al, FEMS Microbiology Letters 37:121-127 
(1986)). Others have cloned and sequenced a thermostable DNA polymerase 
^, . _ .„• fli? Patent 5 374 553. which is expressly 

irom intrrnOtu^u mu/ ~ v -■ 

incorporated herein by reference). 

Other DNA polymerases have been isolated from thermophilic bacteria 
including Bacillus steraothermophilus (Stenesh et al., Biochim. Biophys. Acta 
272:156-166 (1972); and Kaboev et al. J. Bacterial 145:21-26 (1981)) and 
several archaebacterial species (Rossi et al, System. Appl Microbiol. 7:337-341 
(1986); Klimczak et al. Biochemistry 25:4850-4855 (1986): and Elie et al, Eur. 
J. Biochem. 775:619-626(1989)). The most extensively purified archaebacterial 
DNA polymerase had a reported half-life of 15 minutes at 87 °C (Elie et al 
(1989), supra). Innis et al, In PCR Protocol: A Guide To Methods and 
Amplification. Academic Press, Inc., San Diego (1990) noted that there are 
several extreme thermophilic eubacteria and archaebacteria that are capable of 
growth at very high temperatures (Bergquist et al, Biotech. Genet. Eng. Rev. 
5:199-244 (1987); and Kelly et al, Biotechnol. Prog. 4A1-62 (1988)) and 
suggested that these organisms may contain very thermostable DNA polymerases. 

In many of the known polymerases, the 5'-3' exonuclease activity is 
present in the N-terminal region of the polymerase. (Ollis, et al, Nature 
375:762-766 (.1985); Freemont et al, Proteins 7:66-73 (1986); Joyce, Cur. Opin. 



polymerase l. (.uuunaii c 
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amino adds include Tyr", Gly'°\ Gly»\ and Gly'« in E. coli DNA polymerase 
I It is known that the 5 '-exonuclease domain is dispensable. The best known 
example is the Klenow fragment of £. coli polymerase I. The Klenow fragment 
is a natural proteolytic fragment devoid of 5 '-exonuclease activity (Joyce et. ai, 
J.BioLChem. 257:1958-64(1990).) Polymerases lacking this activity are useful 

for DNA sequencing. 

Most DNA polymerases also contain a 3'-5' exonuclease activity. This 
exonuclease activity provides a proofreading ability to the DNA polymerase. A 
T5 DNA polymerase that lacks 3'-5' exonuclease activity is disclosed in U.S. 

. - . *v,;c crtivitv are Darticularly useful 
Patent No. 5,270.1 /V. roiymerabc* i«<m.. b — - • 

for DNA sequencing. 

The polymerase active site, including the dNTP binding domain is usually 
present at the carboxyl terminal re gl on of the polymerase (Ollis et al., Nature 
373-762-766 (1985); Freemont et al., Proteins 7:66-73 (1986)). It has been 
shown that Phe- of £ coli polymerase I is one of the amino acids that directly 
interacts with the nucleotides (Joyce & Steitz, Ann. Rev. Biochem. 63:777-822 
(1994); Astatke, J. Biol. Chem. 270:1945-54(1995)). Converting this amino acid 
to a Tyr results in a mutant DNA polymerase that does not discriminate against 
dideoxynucleotides. See copending U.S. Application No. 08/525,087, of Deb K. 
Chatterjee. filed September 8, 1995, entitled "Mutant DNA Polymerases and the 
Use Thereof," which is expressly incorporated herein by reference. 

Thus, there exists a need in the art to develop more thermostable DNA 
polymerases. There also exists a need in the art to obtain wild type or mutant 
DNA polymerases that are devoid of exonuclease activities and are non- 
discriminating against dideoxynucleotides. 



PCT/US96/14189 



Summary of the Invention 

Bc present invention satisfies these needs in the art by providing 
additional DNA polymerases useful in molecuiar biology. Specified, dns 
invention includes a thermostable DNA polymerase. Preferably, the polymerase 
has a molecular wetgh. of about .00 kilodal.on, Specifca.ly, the DNA 
polymerase of invention is isolated from nermcoga, and more specially, 
me DNA polymerase ,s obtained from ThermCoga neapoli.am (Tne) and 
Thermos marUl.a (TfcA The Tkerno.oga species preferred for isolaung me 
DNA polymerase of the present invention was isolated fro.,, an African 
continental solfataric spring (Windberger e, a,.. Arch. Microbiol. ,51. 506-512, 
(1989)). 

The Thermos DNA polymerases of the present invention are extremely 
thermostable, showing more than 50% of acuvity after be.ng heated for 60 
minutes at 90°C wtth or without detergent. Thus, the DNA polymerases of the 
present invention is more thermostable than Tag DNA polymerase. 

The present invention is also directed to cloning a gene encodmg a 
THer m o,o g a DNA polymerase enzyme. DNA molecules contatning the 
7fcn-K»*. DNA polymerase genes, accordmg to the present tnvention, can be 
transformed and expressed in a host cell to produce the DNA polymerase. Any 
number of hosts may be used to express the THer m o,o g a DNA poiymerase gene 
of the present invention, including prokaryotie and eukaryotic cells. Preferab.y, 
proktuyotic cells are used to express the DNA polymerase of the invention. The 
preferred prokaryotie host according to the present invention ts E. coh. 

The present invenuon also relates — thermostable DNA polymerases 
of the Poll type artd DNA coding therefor, wherein mere is amino acid change ,n 
,h, O-helix which renders the polymerase nondiscriminatory against ddNTPs m 



wherein X is any ammo acui 
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The present invention also relates to Thermotoga DNA polymerase 
mutants that lack exonuclease activity and/or which are nondiscriminatory against 
ddNTPs in sequencing reactions. 

The present invention is also directed generally to DNA polymerases that 
have mutations that result in substantially reduced or missing 5'-3' exonuclease 
activity. 

In particular, the invention relates to a Thermotoga DNA polymerase 
mutant which is modified at least one way selected from the group consisting of 

(a) to reduce or eliminate the 3 '-5" exonuclease activity of the 
polymerase; 

(b) to reduce or eliminate the 5 '-3' exonuclease activity of the 
polymerase; and 

(c) to reduce or eliminate discriminatory behavior against a 
dideoxvnucleotide. 

The invention also relates to a method of producing a DNA polymerase, 

said method comprising: 

(a) culturing the host cell of the invention; 

(b) expressing said gene; and 

(c) isolating said DNA polymerase from said host cell. 

The invention also relates to a method of synthesizing a double-stranded 
DNA molecule comprising: 

(a) hybridizing a primer to a first DNA molecule; and 

(b) incubating said DNA molecule of step (a) in the presence of one 
or more deoxy- or dideoxynbonucleoside triphosphates and the DNA polymerase 
of the invention, under conditions sufficient to synthesize a second DNA 
molecule complementary to all or a portion of said first DNA molecule. Such 
deoxy- and dideoxynbonucleoside triphosphates include dATP, dCTP, dGTP, 
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The invents also relates to a method of sequencing a DNA molecule, 
comprising: 

(a) hybridizing a primer to a first DNA molecule; 

(b) contacting said DNA molecule of step (a) with 
deoxyribonucleoside triphosphates, the DNA polymerase of the invention, and 
a terminator nucleotide; 

(c) incubating the mixture of step (b) under conditions sufficient to 
synthesize a random population of DNA molecules complementary to said first 
DNA molecule, wherein said synthesized DNA molecules are shorter in length 

_ * j .u^-joA TTNIA molecules 

than said first DNA molecule and wnerein sam - - ~- 

comprise a terminator nucleotide at their 3' termini; and 

(d) separating said synthesized DNA molecules by size so that at least a 
part of the nucleotide sequence of said first DNA molecule can be determined. 
Such terminator nucleotides include ddTTP, ddATP, ddGTP, ddlTP or ddCTP. 

The invention also relates to a method for amplifying a double stranded 
DNA molecule, comprising: 

(a) providing a first and second primer, wherein said first primer is 
complementary to a sequence at or near the 3 -termim of the first strand of said 
DNA molecule and said second primer is complementary to a sequence at or near 
the 3'-termini of the second strand of said DNA molecule; 

(b) hybridizing said first primer to said first strand and said second 
primer to said second strand in the presence of the DNA polymerase of the 
invention, under conditions such that a third DNA molecule complementary to 
said first strand and a fourth DNA molecule complementary to said second strand 
are synthesized; 

(c) denaturing said first and third strand, and said second and fourth 
strands: and 
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The invention also relates to a kit for sequencing a DNA molecule, 
comprising: 

(a) a first container means comprising the DNA polymerase of the 
invention; 

(b) a second container means comprising one or more 
dideoxyribonucleoside triphosphates; and 

(c) a third container means comprising one or more 
deoxyribonucleoside triphosphates. 

The invention also relates to a kit for amplifying a DNA molecule, 

comprising: 

(a) a first container means comprising the DNA polymerase of the 
invention; and 

(b) a second container means comprising one or more 
deoxyribonucleoside triphosphates. 

The present invention also relates to a mutant DNA polymerase having 
substantially reduced or eliminated 5'-3' exonuclease activity, wherein at least 
one of the amino acids corresponding to Asp 8 , Glu'», Asp 114 , Asp" 5 , Asp 1 ", 
Asp' 59 . Gly 102 , Gly 187 , or Gly 195 of Tne DNA polymerase has been mutated. 

The present invention also relates to a method of producing a mutant 
DNA polymerase having substantially reduced or eliminated 5'-3' exonuclease 
activity, wherein at least one of the amino acids corresponding to Asp 8 , Glu 112 , 
Asp" 4 , Asp 115 , Asp 137 , Asp 139 , Gly 102 , Gly 187 , or Gly 195 of Tne DNA polymerase 
has been mutated, comprising: 

(a) culturing the host cell of the invention; 

(b) expressing the mutant DNA polymerase; and 

(c) isolating said mutant DNA polymerase. 
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Brief Description of the Figures 

FIG. 1 demonstrates the heat stability of Tne DNA polymerase at 90 °C 
over time. Partially purified DNA polymerase from the crude extract of 
Thermotoga neapolitana cells was used in the assay. 

FIG. 2 shows the time-dependent DNA polymerase activity of Tne DNA 
polymerase isolated from an E. coli host containing the cloned Tne DNA 
polymerase gene. 

FIG. 3 compares the ability of various DNA polymerases to incorporate 
radioactive dATP and [aS]dATP. Tne DNA polymerase is more effective at 
incorporating [aS]dATP than was Taq DNA polymerase. 

FIG. 4 shows the restriction map of the approximate DNA fragment which 
contains the Tne DNA polymerase gene in pSport 1 and pUC19. This figure also 
shows the region containing the O-helix homologous sequences. 

FIGS. 5 A and 5B shows the nucleotide and deduced amino acid 
sequences, in all 3 reading frames, for the carboxyl terminal portion, including 
the O-helix region, of the Thermotoga neapolitana polymerase gene. 

FIG. 6A schematically depicts the construction of plasmids pUC-Tne 
(3'-5') and pUC-Tne FY. 

FIG. 6B schematically depicts the construction of plasmids pTrc Tne35 

and pTrcTne FY. 

FIG. 7 schematically depicts the construction of plasmid pTrcTne35 FY. 

FIG. 8 schematically depicts the construction of plasmid pTTQTne5 FY 
and pTTQTne535FY. 

FIG. 9 depicts a gel containing two sequencing reaction sets showing the 
efficient 35 S incorporation by Tne DNA polymerase of Example 12. Alkali- 
denatured pUC19 DNA was sequenced with Tne DNA polymerase in set A. M13 



HG. 1 1) depicts a uci coniunune unci, mmuuiuumu,^ ... 
that the mutant Tne DNA polymerase of Example 12 generates clear sequence 



WO 97/09451 



PCT/US96/14189 



from plasmids containing cDNAs with poly(dA) tails. Alkali-denatured plasmid 
DNAs containing cDNA inserts were sequenced using either Tne DNA 
polymerase (sets A and B). or Sequenase Ver 2.0 (set C). 

FIG. 1 1 depicts a gel containing three sequencing reaction sets that 
compare the mutant Tne DNA polymerase of Example 12 (set A), Sequenase™ 
(set B) and Tag DNA polymerase (set C) generated sequences from a plasmid 
containing poly(dC). 

FIG. 12 depicts a gel containing three sequencing reaction sets showing 
that the mutant Tne DNA polymerase of Example 12 (set A) produces 35 S-labeled 

— , o TM lr. a * m <jnrl without the 

sequence 3-fold stronger man inermo sequent w - ~, — 

uneven band intensities obtained with Tag DNA polymerase (set C). 

FIG. 13 depicts a gel containing four sequencing reaction sets 
demonstrating that the mutant Tne DNA polymerase of Example 12 produces 
high quality sequences of in vitro amplified DNA ( set A, E. coli P poll (~450bp); 
set B, E. coli rrsE (-350 bp); set C, ori from pSClOl (~1 .5 kb); and set D, an 
exon from human HSINF gene (-750 bp). 

FIGS. 14A and 14B depict gels containing three and four sequencing 
reaction sets, respectively, showing that the mutant Tne DNA polymerase of 
Example 12 provides superior sequence from double-stranded DNA clones 
containing poly(dA) or poly(dC) stretches. Fig. HA, supercoiled plasmid DNAs 
containing inserts with homopolymers were cycle sequenced using the mutant 
Tne DNA polymerase (set A. RPA1; set B, elf (cap binding protein); and set C, 
apoly(dC)-tailed 5' RACE-derived insert). Fig. 14B, supercoiled plasmid DNAs 
containing inserts with homopolymers were cycled sequenced using Tag DNA 
polymerase (set D), or SequiTherm™ (sets E-G) (set D, RPA; set E, RPA; set F, 
a P oly(dC)-tailed 5' RACE-derived insert; and set G, elf). 

FIG. 15 - depicts a gel containing two sequencing reaction sets showing 



LTia-iuDeieu prune; 
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FIGS. 16A-16C and 16D-16F depict two sets of chromatograms showing 
comparison of the mutant Tne DNA polymerase of Example 12 (16A-16C) to 
AmpliTaq FS™ (16D-16F) in Fluorescent Dye Primer Sequencing. 

FIGS. 17A-17C and 17D-17F depict chromatograms showing a 
comparison of the mutant Tne DNA polymerase of Example 12 (17A) to 
AmpliTaq FS™ (17B) in Fluorescent Dye Terminator Sequencing. 

Detailed Description of the Preferred Embodiments 



Definitions 

In the description that follows, a number of terms used in recombinant 
DNA technology are extensively utilized. In order to provide a clearer and 
consistent understanding of the specification and claims, including the scope to 
be given such terms, the following definitions are provided. 

Cloning vector. A plasmid, cosmid or phage DNA or other DNA 
molecule which is able to replicate autonomously in a host cell, and which is 
characterized bv one or a small number of restriction endonuclcase recognition 
sites a, which such DNA sequences may be cut in a determinable fashion without 
loss of an essential biological function of the vector, and into which DNA may 
be spliced in order to bring about its replication and cloning. The cloning vector 
may further contain a marker suitable for use in the identification of cells 
transformed with the cloning vector. Markers, for example, are tetracychne 
resistance or ampicillin resistance. 

Expression vector. A vector similar to a cloning vector but which is 
capable of enhancing the expression of a gene which has been cloned into it, after 
.^fnrmntiofi into a host The cloned gene is usually placed under the control 



Recombinant host. Anv prokaryoiK «>i cu..h w - 

which contains the desired cloned genes in an expression vector, cloning vector 
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or any DNA molecule. The term "recombinant host" is also meant to include 
those host cells which have been genetically engineered to contain the desired 
gene on the host chromosome or genome. 

Host. Any prokaryotic or eukaryotic microorganism that is the recipient 
of a replicable expression vector, cloning vector or any DNA molecule. The 
DNA molecule may contain, but is not limited to, a structural gene, a promoter 
and/or an origin of replication. 

Promoter. A DNA sequence generally described as the 5' region of a 
gene, located proximal to the start codon. At the promoter region, transcription 
of an adjacent gene(s) is initiated. 

Gene. A DNA sequence that contains information necessary for 
expression of a polypeptide or protein. It includes the promoter and the structural 
gene as well as other sequences involved in expression of the protein. 

Structural gene. A DNA sequence that is transcribed into messenger 
RNA that is then translated into a sequence of amino acids characteristic of a 
specific polypeptide. 

Operably linked. As used herein means that the promoter is positioned 
to control the initiation of expression of the polypeptide encoded by the structural 
gene. 

Expression. Expression is the process by which a gene produces a 
polypeptide. It includes transcription of the gene into messenger RNA (mRNA) 
and the translation of such mRNA into polypeptide(s). 

Substantially Pure. As used herein "substantially pure" means that the 
desired purified protein is essentially free from contaminating cellular 
contaminants which are associated with the desired protein in nature. 
Contaminating cellular components may include, but are not limited to, 
phosphatases, exonucleases. endonucleases or undesirable DNA polymerase 
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Primer. As used herein "primer" refers to a single-stranded 
oligonucleotide that is extended by covalent bonding of nucleotide monomers 
during amplification or polymerization of a DNA molecule. 

Template. The term "template" as used herein refers to a double- stranded 
or single-stranded DNA molecule which is to be amplified, synthesized or 
sequenced. In the case of a double-stranded DNA molecule, denaturation of its 
strands to form a first and a second strand is performed before these molecules 
may be amplified, synthesized or sequenced. A primer, complementary to a 
portion of a DNA template is hybridized under appropriate conditions and the 
DNA polymerase of the invention may then synthesize a DNA molecule 
complementary to said template or a portion thereof. The newly synthesized 
DNA molecule, according to the invention, may be equal or shorter in length than 
the original DNA template. Mismatch incorporation during the synthesis or 
extension of the newly synthesized DNA molecule may result in one or a number 
of mismatched base pairs. Thus, the synthesized DNA molecule need not be 
exactly complementary to the DNA template. 

Incorporating. The term "incorporating" as used herein means becoming 
a part of a DNA molecule or primer. 

Amplification. As used herein "amplification" refers to any in vitro 
method for increasing the number of copies of a nucleotide sequence with the use 
of a DNA polymerase. Nucleic acid amplification results in the incorporation of 
nucleotides into a DNA molecule or primer thereby forming a new DNA 
molecule complementary to a DNA template. The formed DNA molecule and 
its template can be used as templates to synthesize additional DNA molecules. 
As used herein, one amplification reaction may consist of many rounds of DNA 
replication. DNA amplification reactions include, for example, polymerase chain 
reactions (PCR). One PCR reaction may consist of 30 to 100 "cycles" of 



Oligonucleotide. > Higonucieotiue :cici.- • .uu*.«* 
molecule comprising a covalentlv linked sequence ot nucleotides which , 
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joined by a phosphodiester bond between the 3' position of the pentose of one 
nucleotide and the 5' position of the pentose of the adjacent nucleotide. 

Nucleotide. As used herein "nucleotide" refers to a base-sugar-phosphate 
combination. Nucleotides are monomeric units of a nucleic acid sequence (DNA 
and RNA). The term nucleotide includes deoxyribonucleoside triphosphates such 
as dATP, dCTP, dITP, dUTP, dGTP, dTTP, or derivatives thereof. Such 
derivatives include, for example, [aS]dATP, 7-deaza-dGTP and 7-deaza-dATP. 
The term nucleotide as used herein also refers to dideoxyribonucleoside 
triphosphates (ddNTPs) and their derivatives. Illustrated examples of 

. . . , _ t« A TP 

dideoxyribonucleoside triphosphates include, dui aic uui — — 

ddCTP. ddGTP. ddlTP. and ddTTP. According to the present invention, a 
"nucleotide" may be unlabeled or detectably labeled by well known techniques. 
Detectable labels include, for example, radioactive isotopes, fluorescent labels, 
chemiluminescent labels, bioluminescent labels and enzyme labels. 

Thermostable. As used herein "thermostable" refers to a DNA 
polymerase which is resistant to inactivation by heat. DNA polymerases 
synthesize the formation of a DNA molecule complementary to a single-stranded 
DNA template by extending a primer in the 5'-to-3' direction. This activity for 
mesophilic DNA polymerases may be inactivated by heat treatment. For 
example. T5 DNA polymerase activity is totally inactivated by exposing the 
enzyme to a temperature of 90°C for 30 seconds. As used herein, a thermostable 
DNA polymerase activity is more resistant to heat inactivation than a mesophilic 
DNA polymerase. However, a thermostable DNA polymerase does not mean to 
refer to an enzyme which is totally resistant to heat inactivation and thus heat 
treatment may reduce the DNA polymerase activity to some extent. A 
thermostable DNA polymerase typically will also have a higher optimum 
temperature than mesophilic DNA polymerases. 



and/or DNA) to give a double-stranded molecule. As used herein, iwo 
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acid molecules may be hybridized, although the base pairing is not completely 
complementary. Accordingly, mismatched bases do not prevent hybridization of 
two nucleic acid molecules provided that appropriate conditions, well known in 
the art, are used. 

3'-to-5' Exonuclease Activity. "3'-to-5' exonuclease activity" is an 
enzymatic activity well known to the art. This activity is often associated with 
DNA polymerases, and is thought to be involved in a DNA replication "editing" 
or correction mechanism. 

A "DNA polymerase substantially reduced in 3'-to-5' exonuclease 

.is rvM a nAivmprave that has about 

activity" is delined herein as euner u ; a ^ - - v ~ - 

or less than 10%. or preferably about or less than 1%. of the 3'-to-5' exonuclease 
activity of the corresponding unmutated. wild-type enzyme, or (2) a DNA 
polymerase having a 3'-to-5' exonuclease specific activity which is less than 
about 1 unit/mg protein, or preferably about or less than 0.1 units/mg protein. A 
unit of activity of 3'-to-5' exonuclease is defined as the amount of activity that 
solubilizes 10 nmoles of substrate ends in 60 min. at 37°C, assayed as described 
in the "BRL 1989 Catalogue & Reference Guide", page 5, with Html fragments 
of lambda DNA 3 '-end labeled with pHJdTTP by terminal deoxynucleotidyl 
transferase (TdT). Protein is measured by the method of Bradford, Anal. 
Biochem. 72:248 (1976). As a means of comparison, natural, wild-type T5-DNA 
polymerase (DNAP) or T5-DNAP encoded by pTTQ19-T5-2 has a specific 
activity of about 10 units/mg protein while the DNA polymerase encoded by 
pTTQ19-T5-2(Exo) (U.S. 5,270,179) has a specific activity of about 
0.0001 units/mg protein, or 0.001% of the specific activity of the unmodified 
enzyme, a 10 5 -fold reduction. 

5'-to-3' Exonuclease Activity. "5'-to-3' exonuclease activity" is also an 
enzymatic activity well known in the art. This activity is often associated with 
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or less than 10%. or preferably about or less than 1%. of the 5'-to-3' exonuclease 
activity of the corresponding unmutated. wild-type enzyme, or (2) a DNA 
polymerase having 5'-to-3' exonuclease specific activity which is less than about 
1 unit mg protein, or preferably about or less than 0.1 units/mg protein. 

Both of the 3'-to-5' and 5'-to-3' exonuclease activities can be observed 
on sequencing gels. Active 5'-to-3' exonuclease activity will produce nonspecific 
ladders in a sequencing gel by removing nucleotides from the 5'-end of the 
growing primers. 3'-to-5' exonuclease activity can be measured by following the 
degradation of radiolabeled primers in a sequencing gel. Thus, the relative 
. . . • :i j «, mntont nnlvmerases. 

amounts of these activities, e.g. oy comparing wnu-ivpw — - 
can be determined with no more than routine experimentation. 



Cloning and Expression of Thermotoga DNA Polymerases 



The Thermotoga DNA polymerase of the invention can be isolated from 
any strain of Thermotoga which produces a DNA polymerase. The preferred 
strain to isolate the gene encoding Thermotoga DNA polymerase of the present 
invention is Thermotoga neapolitana (Tne) and Thermotoga mantuna (Tma). 
The most preferred Thermotoga neapolitana for isolating the DNA polymerase 
of the invention was isolated from an African continental solfataric spring 
(Windberger et aL Areh. Microbiol 757:506-512 (1989) and may be obtained 
from Deutsche Sammlung von Microorgamsmen und Zellkulturen GmbH 
(DSMZ; German Collection of Microorganisms and Cell Culture) Mascheroder 
Weg lb D-38124 Braunschweig. Germany, as Deposit No. 5068 (deposited 
December 13. 1988). 

To clone a gene encoding a Thermotoga DNA polymerase of the 
invention, isolated DNA which contains the polymerase gene obtained from 
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must be compatible with the host in which the recombinant DNA library will be 
transformed. 

Prokaryotic vectors for constructing the plasmid library include plasmids 
such as those capable of replication in E. coli such as, for example, pBR322, 
ColEl, pSClOl, pUC-vectors (pUC18, pUC19, etc.: In: Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York (1982); and Sambrook et al, In: Molecular Cloning A Laboratory 
Manual (2d ed.) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1989)). Bacillus plasmids include pC 194, pC221, P C2 17. etc. Such 
plasmids are disclosed by Giyczan, T. In- The Molecular Biology Bacilli, 
Academic Press, York (1982), 307-329. Suitable Streptomyces plasmids include 
pIJlOl (Kendall et al, J. Bacterial 169:4177-4183 (1987)). Pseudomonas 
plasmids are reviewed by John et al, (Rod. Insec. Dis. 5:693-704 (1986)), and 
Igaki, (Jpn. J. Bacterial 35:729-742 (1978)). Broad-host range plasmids or 
cosmids, such as pCP13 (Darzins and Chakrabarbary, J. Bacterial. 759:9-18, 
1984) can also be used for the present invention. The preferred vectors for 
cloning the genes of the present invention are prokaryotic vectors. Preferably, 
pCP13 and pUC vectors are used to clone the genes of the present invention. 

The preferred host for cloning the wild type or mutant DNA polymerase 
genes of the invention is a prokaryotic host. The most preferred prokaryotic host 
is E coli. However, the wild type or mutant DNA polymerase genes of the 
present invention may be cloned in other prokaryotic hosts including, but not 
limited to, Escherichia, Bacillus, Streptomyces, Pseudomonas, Salmonella, 
Serratia, and Proteus. Bacterial hosts of particular interest include E. coli 
DH10B, which may be obtained from Life Technologies, Inc. (LTI) 
(Gaithersburg, MD). 

Eukaryotic hosts for cloning and expression of the wild type or mutant 

.ens. repression oi mc uesirca w.n.-\ ( Husmc;.i..;. ' 

require the use of eukaryotic regulatory regions which include eukaryotic 
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promoters. Cloning and expressing the wild type or mutant DN A polymerase 
gene of the invention in eukaryotic cells may be accomplished by well known 
techniques using well known eukaryotic vector systems. 

Once a DNA library has been constructed in a particular vector, an 
appropriate host is transformed by well known techniques. Transformed colonies 
are plated at a density of approximately 200-300 colonies per petri dish. Colonies 
are then screened for the expression of a heat stable DNA polymerase by 
transferring transformed E. coli colonies to nitrocellulose membranes. After the 
transferred cells are grown on nitrocellulose (approximately 12 hours), the cells 

. . , ■ i o^otW tr^H stQ'iTfnr 

are lysed by standard lecnniques. anu mc manuia.^ - - 

5 minutes to inactivate the endogenous E. coli enzyme. Other temperatures may 
be used to inactivate the host polymerases depending on the host used and the 
temperature stability of the DNA polymerase to be cloned. Stable DNA 
polymerase activity is then detected by assaying for the presence of DNA 
polymerase activity using well known techniques. Sagner et ai. Gene 
97:1 19-123 (1991). which is hereby incorporated by reference in its entirety. The 
gene encoding a DNA polymerase of the present invention can be cloned using 
the procedure described by Sagner et ai. supra. 

The recombinant host containing the wild type gene encoding Tne DNA 
polymerase. E. coli DH10B (pUC-Tne), was deposited on September 30, 1994. 
with the Agricultural Research Culture Collection (NRRL), 1815 N. University 
Street. Peoria. IL 61604 USA as Deposit No. NRRL B-21338. The gene 
encoding Tma DNA polymerase has also been cloned and sequenced (U.S. Patent 
5,374,553, which is expressly incorporated by reference in its entirety). 

If the Thermotoga (e.g., Tne or Tma) DNA polymerase has 3'-to-5' 
exonuclease activity, this activity may be reduced, substantially reduced, or 
eliminated by mutating the DNA polymerase gene. Such mutations include point 
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techniques well known in the art (Sambrook el al, (1989) in: Molecular Cloning, 
A Laboratory Manual (2nd Ed.), Cold Spring Harbor Laboratory Press, Cold 

Spring Harbor. N.Y.)- 

The 3'-to-5' exonuclease activity can be reduced or impaired by creating 
site specific mutants within the3'-5' exonuclease domain. See infra. In a 
specific embodiment of the invention Asp- 3 of Tne DNA polymerase (SEQ ID 
NO. 3) is changed to any amino acid, preferably to Ala 3 " to substantially reduce 
3'-to-5' exonuclease activity. In another specific embodiment of the invention. 
Asp 323 of Tma may be changed to any other amino add. preferably to Ala to 
substantially reduce 3'-to-5' exonuclease activity. 

The 5'-3' exonuclease activity of the DNA polymerase can be reduced 
or eliminated by mutating the DNA polymerase gene. Such mutations include 
point mutations, frame shift mutations, deletions, and insertions. Preferably, the 
region of the gene encoding the 5'-3' exonuclease activity is deleted using 
techniques well known in the art. In embodiments of this invention, any one of 
six conserved amino adds that are associated with the 5'-3' exonuclease activity 
ean be mutated. Examples of these conserved amino acids with respect to Tne 
DNA polymerase include Asp 8 , Glu" 2 , Asp 1 ' 4 , Asp" 5 , Asp 137 , and Asp' 39 . Other 
possible sites for mutation are: Gly 102 , Gly' 87 and Gly' 95 . 

The present invention is directed broadly to mutations of DNA 
polymerases that result in the reduction or elimination of 5'-3' exonuclease 
activity. Other particular mutations correspond to the following amino acids. 

E. coli poll: Asp' 3 , Glu" 3 , Asp" 5 , Asp" 6 , Asp 138 , and Asp 140 
Tag pol: Asp' 8 , Glu" 7 , Asp" 9 , Asp' 20 , Asp 142 , and Asp' 44 
Tma V ol: Asp 8 , Glu" 2 , Asp" 4 , Asp" 5 , Asp 137 , and Asp 139 
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Amino acid residues of Thermotoga maritime* (Tma) DNA polymerase are 
numbered as in U.S. Patent No. 5,374,553. 

By comparison to the amino acid sequence of other DNA polymerases, 
the corresponding sites can easily be located and the DNA mutanigized to prepare 
a coding sequence for the corresponding DNA polymerase which lacks the 5'-3' 
exonuclease activity. Examples of other DNA polymerases that can be so 
mutated include: 



Enzyme or source 


Mutation positions 


Streptococcus pneumoniae 


Asp 10 . Glu" 4 . Asp" 6 , Asp" 7 . Asp 139 . Asp' 41 


Thermus jlavus 


Asp 17 , Glu" 6 , Asp" 8 , Asp 119 , Asp 141 , Asp 1 " 


Thermus thermophilus 


Asp' 8 , Glu" 8 , Asp 120 , Asp 121 , Asp 143 , Asp 145 


Deinococcus radiodurans 


Asp' 8 , Glu" 7 , Asp 1 ", Asp 120 , Asp 142 , Asp 144 


Bacillus caldotenax 


Asp 9 , Glu 109 . Asp" 1 , Asp" 2 , Asp 134 , Asp 136 



Coordinates of S. pneumoniae, T. Jlavus. D. radiodurans, B. caldotenax 
were obtained from Gutman and Minton. Coordinates of T. thermophilus were 
obtained from International Patent No. WO 92/06200. 

To abolish the 5 '-3' exonuclease activity, amino acids are selected which 
have different properties. For example, an acidic amino acid such as Asp may be 
changed to a basic, neutral or polar but uncharged amino acid such as Lys, Arg, 
His (basic); Ala, Val. Leu, He, Pro, Met, Phe, Trp (neutral); or Gly, Ser, Thr, Cys, 
Tyr, Asn or Gin (polar but uncharged). Glu may be changed to Asp, Ala, Val 
Leu, He. Pro, Met, Phe, Trp, Gly, Ser, Thr, Cys, Tyr, Asn or Gin. Specifically, 
the Ala substitution in the corresponding position is expected to abolish 5 '-exo 
activity. 

Preferably, oligonucleotide directed mutagenesis is used to create the 



this technique involves annealing a oligonucleotide complementary (except for 
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25 



one or more mismatches) to a single stranded nucleotide sequence coding for the 
DNA polymerase of interest. The mismatched oligonucleotide is then extended 
by DNA polymerase, generating a double stranded DNA molecule which contains 
the desired change in sequence on one strand. The changes in sequence can of 
course result in the deletion, substitution, or insertion of an amino acid. The 
double stranded polynucleotide can then be inserted into an appropriate 
expression vector, and a mutant polypeptide can thus be produced. The above- 
described oligonucleotide directed mutagenesis can of course be carried out via 
PCR. 

. - . . . _ j :„ .ftk» FYWA 

In other embodiments, the entire y-y exonucieasc uuma». ^ ^ 

polymerase can be deleted by proteolytic cleavage or by genetic engineering. For 
example, a unique Sphl restriction site can be used to obtain a clone devoid of 
nucleotides encoding the 219 ammo terminal amino acids of Tne DNA 
polymerase. Examples of such a clone are pTTQTne535FY and P TTQTne5FY. 
Alternatively, less than the 219 amino terminal amino acids may be removed, for 
example, by treating the DNA coding for the Tne DNA polymerase with an 
exonuclease, isolating the fragments, ligating the fragments into a cloning 
vehicle, transfecting cells with the cloning vehicle, and screening the 
transformants for DNA polymerase activity and lack of 5-3' exonuclease 
activity, with no more than routine experimentation. 

Thermotoga DNA polymerase mutants can also be made to render the 
polymerase non-discriminating against non-natural nucleotides such as 
dideoxynucleotides. Changes within the O-helix of Thermotoga polymerases, 
such as other point mutations, deletions, and insertions, can be made to render the 
polymerase non-discriminating. By way of example, one Tne DNA polymerase 
mutant having this property substitutes a nonnatural amino acid such as Tyr for 
Phe at amino acid 67 as numbered in Figs. 5 A and 5B, and 730 of SEQ ID NO:3 . 

acids 722-7.^ oi SB.) 11) Nl):.> or amino acia. • ..uu.pcjlv ' 

5B. The O-helix may be defined as RXXXKXXXFXXXYX. wherein X is any 
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amino acid. The most important amino acids in conferring discriminatory activity 
include Arg, Lys and Phe. Amino acids which may be substituted for Arg at 
positions 722 are selected independently from Asp, Glu, Ala, Val Leu, He, Pro, 
Met, Phe, Trp, Gly, Ser, Thr, Cys, Tyr. Gin. Asn, Lys and His. Amino acids that 
may be substituted for Phe at position 730 include Lys, Arg, His, Asp, Glu, Ala, 
Val. Leu, He, Pro, Met, Trp, Gly, Ser. Thr, Cys, Tyr, Asn or Gin. Amino acids 
that may be substituted for Lys at position 726 of SEQ ID NO: 3 include Tyr, 
Arg, His, Asp, Glu, Ala, Val, Leu, lie, Pro, Met, Trp, Gly, Ser, Thr, Cys, Phe, 
Asnor Gin. Preferred mutants include Tyr 7 -, Ala™, Ser" 0 and Thr-. Such Tne 
mutants may be prepared by well known methods of site directed mutagenesis as 
described herein. See also Example 10. 

The corresponding mutants can also be prepared from Tma DNA 
polymerase, including Arg-, Lys" 6 and Phe" 0 . Most prefered mutants include 
Phe 730 to Tyr 730 , Ser 730 , Thr 730 and Ala 730 . 

2. Enhancing Expression of Thermotoga DNA Polymerase 

To optimize expression of the wild type or mutant Thermotoga DNA 
polymerases of the present invention, inducible or constitutive promoters are well 
known and may be used to express high levels of a polymerase structural gene in 
a recombinant host. Similarly, high copy number vectors, well known in the art, 
may be used to achieve high levels of expression. Vectors having an inducible 
high copy number may also be useful to enhance expression of Thermotoga DNA 
polymerase in a recombinant host. 

To express the desired structural gene in a prokaryotic cell (such as, 
E. coli, B. subtilis, Pseudomonas. etc.), it is necessary to operably link the desired 
structural gene to a functional prokaryotic promoter. However, the natural 
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be used to enhance expression and may either be constitutive or regulable (i.e., 
inducible or derepressible) promoters. Examples of constitutive promoters 
include the int promoter of bacteriophage X. and the bla promoter of the 
^lactamase gene of P BR322. Examples of inducible prokaryotic promoters 
include the major right and left promoters of bacteriophage X (P R and PJ, trp, 
recA lacZ, lad. tet, gal, trc, and too promoters of E call The B. subtilis 
promoters include a-amylase (Ulmanen et al, J. Bacterial 162:176-182 (1985)) 
and Bacillus bacteriophage promoters (Gryczan. T., In: The Molecular Biology 
Of Bacilli Academic Press. New York (1982)). Streptomyces promoters are 
described by Ward et al., Mol. Gen. Genet. 205:468478 (1986)). Prokaryotic 
promoters are also reviewed by Click. J. IruL Microbial. 1:277-282 (1987); 
Cenatiempto, Y., Biochimie 68:505-5 1 6 (1 986); and Gottesman, Ann. Rev. Genet. 
78:415-442 (1984). Expression in a prokaryotic cell also requires the presence 
of a ribosomal binding site upstream of the gene-encoding sequence. Such 
ribosomal binding sites are disclosed, for example, by Gold et al, Ann. Re, 
Microbiol. 35:365404 (1981). 

To enhance the expression of Thermotoga (e.g., The and Tma) DNA 
polymerase in a eukaryotic cell, well known eukaryouc promoters and hosts may 
be used. Preferably, however, enhanced expression of Thermotoga DNA 
polymerase is accomplished in a prokaryotic host. The preferred prokaryotic host 
for overexpressing this enzyme is E. coh. 

3. Isolation and Purification of Thermotoga DNA Polymerase 

The enzyme(s) of the present invention (Thermotoga DNA polymerases 
and mutants thereof) is preferably produced by fermentation of the recombinant 
host containing and expressing the cloned DNA polymerase gene. However, the 
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invention. Fragments of the polymerase are also included 
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invention. Such fragments include proteolytic fragments and fragments having 
polymerase activity. 

Any nutrient that can be assimilated by Thermotoga or a host containing 
the cloned Thermotoga DNA polymerase gene may be added to the culture 
5 medium. Optimal culture conditions should be selected case by case according 

to the strain used and the composition of the culture medium. Antibiotics may 
also be added to the growth media to insure maintenance of vector DNA 
containing the desired gene to be expressed. Culture conditions for Thermotoga 
neapolitana have, for example, been described by Huber et ah. Arch. Microbiol. 

10 144:324-333 (1986). Media formulations are also described in DSM or ATCC 

Catalogs and Sambrook et ai, In: Molecular Cloning, a Laboratory Manual (2nd 
ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). 

Thermotoga and recombinant host cells producing the DNA polymerase 
of this invention can be separated from liquid culture, for example, by 

15 centrifugation. In general, the collected microbial cells are dispersed in a suitable 

buffer, and then broken down by ultrasonic treatment or by other well known 
procedures to allow extraction of the enzymes by the buffer solution. After 
removal of cell debris by ultracentrifugation or centrifugation, the DNA 
polymerase can be purified by standard protein purification techniques such as 

20 extraction, precipitation, chromatography. affinity chromatography, 

electrophoresis or the like. Assays to detect the presence of the DNA polymerase 
during purification are well known in the art and can be used during conventional 
biochemical purification methods to determine the presence of these enzymes. 

4. Uses of Thermotoga DNA Polymerase 

25 The wild type and mutant Thermotoga DNA polymerases (e.g., Tma and 

Tne) of the present invention may be used in well known DNA sequencing. DNA 
labeling. DNA amplification and cDNA synthesis reactions. Thermotoga DNA 
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activity, devoid of or substantially reduced in 5'- 3' exonuclease activity, or 
containing one or mutations in the O-helix that make the enzyme 
nondiscriminatory for dNTPs and ddNTPs (e.g., a Phe™-+Tyr™ mutation of SEQ 
ID NO: 3) are especially useful for DNA sequencing, DNA labeling, and DNA 
amplification reactions and cDNA synthesis. Moreover, Thermotoga DNA 
polymerase mutants containing two or more of these properties are also especially 
useful for DNA sequencing, DNA labeling, DNA amplification or cDNA 
synthesis reactions. As is well known, sequencing reactions (isothermal DNA 
sequencing and cycle sequencing of DNA) require the use of DNA polymerases. 
Dideoxy-mediated sequencing involves the use of a chain-termination technique 
which uses a specific polymer for extension by DNA polymerase, a base-specific 
chain terminator and the use of polyacrylamide gels to separate the newly 
synthesized chain-terminated DNA molecules by size so that at least a part of the 
nucleotide sequence of the original DNA molecule can be determined. 
Specifically, a DNA molecule is sequenced by using four separate DNA sequence 
reactions, each of which contains different base-specific terminators. For 
example, the first reaction will contain a G-specific terminator, the second 
reaction will contain a T-specific terminator, the third reaction will contain an 
A-specific terminator, and a fourth reaction may contain a C-specific terminator. 
Preferred terminator nucleotides include dideoxyribonucleoside triphosphates 
(ddNTPs) such as ddATP. ddTTP. ddGTP, ddlTP and ddCTP. Analogs of 
dideoxyribonucleoside triphosphates may also be used and are well known in the 
art. 

When sequencing a DNA molecule, ddNTPs lack a hydroxyl residue at 
the 3' position of the deoxyribose base and thus, although they can be 
incorporated by DNA polymerases into the growing DNA chain, the absence of 
the 3 '-hydroxy residue prevents formation of the next phosphodiester bond 



competition between extension of the chain and base-spccihe 
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resulting in a population of synthesized DNA molecules which are shorter in 
length than the DNA template to be sequenced. By using four different ddNTPs 
in four separate enzymatic reactions, populations of the synthesized DNA 
molecules can be separated by size so that at least a part of the nucleotide 
sequence of the original DNA molecule can be determined. DNA sequencing by 
dideoxy-nucleotides is well known and is described by Sambrook et al, In: 
Molecular Cloning, a Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y. ( 1 989). As will be readily recognized, the Thermotoga 
DNA polymerases and mutants thereof of the present invention may be used in 
such sequencing reactions. 

As is well known, detectably labeled nucleotides are typically included 
in sequencing reactions. Any number of labeled nucleotides can be used in 
sequencing (or labeling) reactions, including, but not limited to, radioactive 
isotopes, fluorescent labels, chemiluminescent labels, bioluminescent labels, and 
enzyme labels. It has been discovered that the wild type and mutant DNA 
polymerase of the present invention may be useful for incorporating aS 
nucleotides ([aSJdATP, [aS]dTTP, [aS]dCTP and [<xS]dGTP) during sequencing 
(or labeling) reactions. For example, [a»S]dATP, a commonly used detectably 
labeled nucleotide in sequencing reactions, is incorporated three times more 
efficiently with the Tne DNA polymerase of the present invention, than with Tag 
DNA polymerase. Thus, the enzyme of the present invention is particularly 
suited for sequencing or labeling DNA molecules with [a 35 S]dNTPs. 

Polymerase chain reaction (PCR), a well known DNA amplification 
technique, is a process by which DNA polymerase and deoxyribonucleoside 
triphosphates are used to amplify a target DNA template. In such PCR reactions, 
two primers, one complementary to the 3' termini (or near the 3'-termini) of the 
first strand of the DNA molecule to be amplified, and a second primer 
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deoxyribonucleoside triphosphates, allows the synthesis of a third DNA molecule 
complementary to the first strand and a fourth DNA molecule complementary to 
the second strand of the DNA molecule to be amplified. This synthesis results 
in two double stranded DNA molecules. Such double stranded DNA molecules 

5 may then be used as DNA templates for synthesis of additional DNA molecules 

by providing a DNA polymerase, primers, and deoxyribonucleoside 
triphosphates. As is well known, the additional synthesis is carried out by 
"cycling" the original reaction (with excess primers and deoxyribonucleoside 
triphosphates) allowing multiple denaturing and synthesis steps. Typically, 

10 denaturing oi aouoie siranueu uina niuicv-un-;. ^ ^i... ow. fa ^ 

templates is accomplished by high temperatures. The wild type and mutant 
Thermotoga DNA polymerases of the present invention are heat stable DNA 
polymerases, and thus will survive such thermal cycling during DNA 
amplification reactions. Thus, the wild type and mutant DNA polymerases of the 

15 invention are ideally suited for PCR reactions, particularly where high 

temperatures are used to denature the DNA molecules during amplification. 

The Thermotoga DNA polymerase and mutants of the present invention 
(e.g. Trie and Tma) may also be used to prepare cDNA from mRNA templates. 
See, U.S. Patent Nos. 5,405,776 and 5,244,797, the disclosures of which are 

20 explicitly incorporated by reference herein. Thus, the invention also relates to 

a method of preparing cDNA from mRNA, comprising 

(a) contacting mRNA with an oligo(dT) primer or other 
complementary primer to form a hybrid, and 

(b) contacting said hybrid formed in step (a) with the Thermotoga 
25 DNA polymerase or mutant of the invention and the four dNTPs, 

whereby a cDNA-RNA hybrid is obtained. 

If the reaction mixture is step (b) further comprises an appropriate 
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is also directed to a method of preparing dsDNA with the Thermotoga DNA 
polymerases and mutants thereof of the present invention. 



Kits 



The wild type and mutant Thermotoga DNA polymerases of the invention 
are suited for the preparation of a kit. Kits comprising the wild type or mutant 
DNA polymerase(s) may be used for detectably labeling DNA molecules. DNA 
sequencing, amplifying DNA molecules or cDNA synthesis by well known 
techniques, depending on the content of the kit. See U.S. Patent Nos. 4,962,020. 
5,173.411, 4,795.699, 5,498,523, 5.405,776 and 5,244,797. Such kits may 
comprise a carrying means being compartmentalized to receive in close 
confinement one or more container means such as vials, test tubes and the like. 
Each of such container means comprises components or a mixture of components 
needed to perform DNA sequencing, DNA labeling, DNA amplification, or 
cDNA synthesis. 

A kit for sequencing DNA may comprise a number of container means. 
A first container means may, for example, comprise a substantially purified 
sample of Thermotoga DNA polymerases or mutants thereof. A second container 
means may comprise one or a number of types of nucleotides needed to 
synthesize a DNA molecule complementary to DNA template.. A third container 
means may comprise one or a number of different types of dideoxynucleoside 
triphosphates. A fourth container means may comprise pyrophosphatase. In 
addition to the above container means, additional container means may be 
included in the kit which comprise one or a number of DNA primers. 

A kit used for amplifying DNA will comprise, for example, a first 
container means comprising a substantially pure mutant or wild type Thermotoga 
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Kits for cDNA synthesis will comprise a first container means containing 
the wild type or mutant Tne DNA polymerase of the invention, a second container 
means will contain the four dNTPs and the third container means will contain 
oligo(dT) primer. See U.S. Patent Nos. 5,405,776 and 5,244,797. Since the 
Thermotoga DNA polymerases of the present invention are also capable of 
preparing dsDNA, a fourth container means may contain an appropriate primer 
complementary to the first strand cDNA. 

Of course, it is also possible to combine one or more of these reagents in 
a single tube. A detailed description of such formulations at working 

• onniiMtinn entitled "Stable 

concentrations is aescnoeu in mc f<^" 1 "ff — — - 

Compositions for Nucleic Acid Amplification and Sequencing" filed on August 

14, 1996, which is expressly incorporated by reference herein in its entirety. 

When desired, the kit of the present invention may also include container 

means which comprise detectably labeled nucleotides which may be used during 

the synthesis or sequencing of a DNA molecule. One of a number of labels may 

be used to detect such nucleotides. Illustrative labels include, but are not limited 

to, radioactive isotopes, fluorescent labels, chemiluminescent labels, 

bioluminescent labels and enzyme labels. 

6. Advantages of the Thermotoga DNA Polymerase 

Thermotoga DNA polymerases of the invention have distinct advantages 
in DNA sequencing. For example, when using the Tne DNA polymerase mutants 
of the invention in single-extension sequencing, they generate strong, clear 35 S- 
labeled sequence, increase sequence signal to background ratio, generate *500 
bases of sequence, reduce false stops in the sequencing ladder, and permit high 
temperature sequencing reactions. The efficient »S incorporation by the Tne 

:oiu, eive biiarpci nana* man * % -> 

a longer shelf life than 52 P. further, the Tne polymerase mutants produce longer 
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sequence reads and gives more accurate sequence interpretation. In addition, the 
use of a 70°C reaction temperature with this thermophilic polymerase increases 
sequencing efficiency of structure-containing and GC-rich templates. 

Compared to modified T7 DNA polymerase (Sequenase™), Tne DNA 
polymerase mutants allow improved sequencing efficiency of structure containing 
and GC-rich templates, are more forgiving in incubation times for labeling and 
extensions, and allow one to obtain full length sequence from one-tenth the 
amount of template. With regard to other polymerases, the Tne DNA polymerase 
mutants provide, under appropriate reaction conditions, more even band 

~ r»oHc ovhihit nn WPak Or 

intensities and give longer, more accuiaic ^ 4 uu.^ — - ~ - 

absent "dropout" bands, exhibit improved sequencing efficiency of structure 
containing and GC-rich templates, exhibit no sequence artifacts from templates 
containing homopolymers. and provide for shorter film exposure and/or less 
template input due to the efficient 35 S-dNTP incorporation. 

With regard to cycle sequencing, the Tne DNA polymerase mutants 
generate strong, clear 35 S-labeled sequence, they increase sequence signal to 
background ratio, generate *500 bases of sequence, reduce false stops in the 
sequencing ladder under appropriate conditions, and permit high temperature 
reactions. The Tne DNA polymerase mutants also allow for highly efficient 35 S 
dATP incorporation and therefore shorter film exposures and/or less template 
input, give sharper bands than 32 P, give off lower energy radiation than 32 P and 
have a longer shelf life than 32 P. The Tne DNA polymerase mutants also produce 
longer sequence reads and give more accurate sequence interpretation. 32 P end 
labeling of primers generates data with less background from less pure DNA and 
requires as little as 5 fmole (0.01 ug) of DNA. 

With regard to cycle sequencing, compared to the mutant Taq DNA 
polymerase (Thermo Sequenase™), the Tne DNA polymerase mutants generate 

.-.lep, require no special pi lines ucsign ..it»cu;.. 

of PCR products directly using any primer. Compared iu bequilhcmv ', tne 
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mutants of Tne DNA polymerase generate three times stronger 35 S-labeled 
sequence, give more even band intensities, gives longer and more accurate 
sequence reads, require less template and less primer, and give no sequence 
artifacts from templates containing homopolymers. Compared to various other 
polymerases (e.g. Tth DNA polymerase), the Tne DNA polymerase mutants 
under appropriate reaction conditions generate three times stronger 35 S-labeled 
sequence, give more even band intensities, give longer and more accurate 
sequence reads, give no weak or absent "dropout" bands, improve sequencing 
efficiency of structure-containing and GC-nch templates, and reduce false stops 
in sequencing ladders, including through homopoiymer regions. 

With regard to fluorescent sequencing, the mutants of Tne DNA 
polymerase readily accept dye primers and dye terminators, increase sequence 
signal to background ratio, produce fewer ambiguous calls, and generate *500 
bases of sequence. The Tne DNA polymerase mutants also produce longer 
sequence read lengths, give more accurate sequence interpretation, and allow for 
quantitation of bases in heterologous mixtures. Since the Tne DNA polymerase 
mutants provide for good incorporation of dye terminators, such dye terminators 
can be reduced 500-fold. Further, increased signal improves bases calling, 
reduces cost and time to sequence, eliminates the need to remove excess dye 
terminators before gel loading, and produces more even band intensities. The 
efficient use of dye primers generates data with less background from impure 
DNA and requires as little as 0.6 ug of dsDNA (double-stranded DNA). 

With regard to the use of Thermo Sequenase™ and AmpliTaq FS™ in 
fluorescent sequencing, the Tne DNA polymerase mutants provide more even 
band intensities in dye terminator sequencing and give comparable results with 
dye primers. With regard to SequiTherm™, the Tne DNA polymerase mutants 
oive more even band intensities that give longer, more accurate sequencing reads 



:-nminale i>o>i reaction clean up u , . iviiiiuum ..... 

and allow for quantitation of bases in heterologous mixtures using dye primer,. 
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With regard to the use of various other enzymes in fluorescent 
sequencing, such as AmpliTaq™ and AmpliTaqCS ™ mutant Tne DNA 
polymerases under appropriate reaction conditions provide more even band 
intensities and more accurate sequence reads with both dye terminators and dye 

5 primers, give no weak or absent "dropout" bands, have lower background and 

fewer false stops, use 500-fold less dye terminators, eliminate post reaction clean 
up of dye terminators, require 10-fold less template, and allow for quantitation of 
bases in heterologous mixtures. 

As shown in Fig. 3, Tne DNA polymerase incorporates a-thio dATP at 

10 three times the rate of Tag DNA polymerase. However, surprisingly, when u-thio 

dATP is used in place of dATP in sequencing reactions using [a- 35 S]dATP and 
mutants of Tne DNA polymerase, the resulting sequencing band signal intensity 
is increased by approximately 8-10 fold. The weak signal seen when dATP is 
used reflects the mutant DNA polymerase's strong preference for incorporating 

15 dATP over a-thio dATP from a mixed pool. Attempts to improve signal intensity 

by merely decreasing the amount of dATP resulted in very poor quality sequence 
with many false stops. Parallel experiments with [a- 32 P]dATP and low 
concentrations of dATP produced similar poor quality sequence, indicating that 
the nucleotide concentration imbalance was causing the enzyme to perform 

20 poorly. By using a-thio dATP mixed with [a- 35 S]dATP. the four nucleotide 

concentrations kept constant without diminishing signal or sequence quality. 

Having now generally described the invention, the same will be more 
readily understood through reference to the following Examples which are 
provided by way of illustration, and are not intended to be limiting of the present 
25 invention, unless specified. 
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Example 1: Bacterial Strains And Growth Conditions 

Thermotoga neapolitana DSM No. 5068 was grown under anaerobic 
conditions as described in the DSM catalog (addition of resazurin. Na 2 S, and 
sulfur granules while sparging the media with nitrogen) at 85 °C in an oil bath 
5 from 12 to 24 hours. The cells were harvested by filtering the broth through 

Whatman #1 filter paper. The supernatant was collected in an ice bath and then 
centrifuged in a refrigerated centrifuge at 8,000 rpms for twenty minutes. The 
cell paste was stored at -70 °C prior to total genomic DNA isolation. 

E. coli strains were grown in 2X LB broth base (Lennox L broth base: 

10 GIBCO/BRL) medium. Transformed cells were incubated in SOC (2% tryptone, 

0.5% yeast extract, yeast 10 mM NaCl, 2.5 mM KC1, 20mM glucose. lOmM 
MgCl 2 , and lOmM MgS0 4 per liter) before plating. When appropriate antibiotic 
supplements were 20 mg/1 tetracycline and 100 mg/1 ampicillin. E. coli strain 
DH10B (Lorow et al.. Focus 12:19-20 (1990)) was used as host strain. 

15 Competent DH10B may be obtained from Life Technologies, Inc. (LTI) 

(Gaithersburg, MD). 



Example 2: DNA Isolation 



Thermotoga neapolitana chromosomal DNA was isolated from l.lg of 
cells by suspending the cells in 2.5 ml TNE (50mM Tris-HCl, pH 8.0, 50mM 
20 NaCl, lOmM EDTA) and treated with 1% SDS for 10 minutes at 37°C. DNA 

was extracted with phenol by gently rocking the lysed cells overnight at 4°C. 
The next day, the lysed cells were extracted with chloroform:isoamyl alcohol. 
The resulting chromosomal DNA was further purified by centrifugation in a CsCl 
i.,., nrrsHirTit rhromosnmnl DNA isolated from the densitv eradient was 



containing lOmM I ris-I K '1 ( pi I X ( ! ) an.i 1 mM LDTA (IT 
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Example 3: Construction of Genomic Libraries 

The chromosomal DNA isolated in Example 2 was used to construct a 
genomic library in the plasmid pCP13. Briefly, 10 tubes each containing 10„g 
of Thermotoga neapolitana chromosomal DNA was digested with 0.01 to 10 
units of Sau3Al for 1 hour at 37°C. A portion of the digested DNA was tested 
in an agarose (1.2%) gel to determine the extent of digestion. Samples with less 
than 50% digestion were pooled, ethanol precipitated and dissolved in TE. 6.5 
Mg of partially digested chromosomal DNA was ligated into 1.5 of P CP13 
cosmid which had been digested with BamHl restriction enUonucleasc and 
dephosphorylated with calf intestinal alkaline phosphatase. Ligation of the 
partially digested Thermoioga DNA and BamHl cleaved P CP13 was carried out 
with T4 DNA ligase at 22°C for 16 hours. After ligation, about l^g of ligated 
DNA was packaged using A-packaging extract (obtained from Life Technologies, 
Inc., Gaithersburg, MD). DH10B cells (Life Tech. Inc.) were then infected with 
100 & of the packaged material. The infected cells were plated on tetracycline 
containing plates. Serial dilutions were made so that approximately 200 to 300 
tetracycline resistant colonies were obtained per plate. 

Example 4: Screening for Clones Expressing Thermotoga 
neapolitana DNA Polymerase 

Identification of the Thermotoga neapolitana DNA polymerase gene of 
the invention was cloned using the method of Sagner et al. Gene 97:1 19-123 
(1991) which reference is herein incorporated in its entirety. Briefly, the E. coli 
tetracycline resistant colonies from Example 3 were transferred to nitrocellulose 
membranes and allowed to grow for 12 hours. The cells were then lysed with the 



room temperature, i lie memorane* were men ireaieu ai •'. 

inactivate the endogenous E. coli enzymes. Surviv.ng DNA polymerase acuvn; 
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was detected by submerging the membranes in 15 ml of polymerase reaction mix 
(50 mM Tris-HCl (pH 8.8), 1 raM MgCl 2 , 3 mM p-mercaptoethanol, 10 fjM 
dCTP, dGTP, dTTP, and 15 /zCi of 3,000 Ci/mmol [a 32 P]dATP) for 30 minutes 
at65°C. 

Using autoradiography, three colonies were identified that expressed a 
Thermotoga neapolitana DNA polymerase. The cells were grown in liquid 
culture and the protein extract was made by sonication. The presence of the 
cloned thermostable polymerase was confirmed by treatment at 90°C followed 
by measurement of DNA polymerase activity at 72°C by incorporation of 

_i : +---u~ — woo ir.tr> cr\r\ insoluble DNA. One of 

radioactive aeoxyriDuimuicuMuc uipiwp.u^ "» 

the clones, expressing Tne DNA polymerase, contained a plasmid designated 
pCP 13-32 and was used for further study. 

Example 5: Subcloning of Tne DNA polymerase 

Since the pCP13-32 clone expressing the Tne DNA polymerase gene 
contains about 25 kb of T. neapolitana DNA, subcloning a smaller fragment of 
the Tne polymerase gene was attempted. The molecular weight of the Tne DNA 
polymerase purified from E. co/// P CP13-32 was about 100 kd. Therefore, a 
2.5-3.0 kb DNA fragment will be sufficient to code for full-length polymerase. 
A second round of Saul A partial digestion similar to Example 3 was done using 
pCP13-32 DNA. In this case, a 3.5 kb region was cut out from the agarose gel, 
purified by Gene Clean (BIO 101, La Jolla, CA) and ligated into plasmid 
pSport 1 (Life Technologies, Inc.) which had been linearized with BamUl and 
dephosphorylated with calf intestinal alkaline phosphatase. After ligation, 
DH10B was transformed and colonies were tested for DNA polymerase activity 
as described in Example 4. Several clones were identified that expressed Tne 
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Fig. 4. Further, a 2.7 Kb Hindlll-Sstl fragment was subcloned into pUC19 to 
generate pUC19-7>ie. E. coli/pVC\9-Tne also produced Tne DNA polymerase. 

The Tne polymerase clone was sequenced by methods known in the art. 
The nucleotide sequence obtained of the 5' end prior to the start ATG is shown 
in SEQ ID NO:l. The nucleotide sequence obtained which encodes carboxy- 
terminal region of the Tne polymerase is shown in Figs. 5A and 5B (SEQ ID 
NO: 17). When SEQ ID NO: 17 is translated it does not produce the entire amino 
acid sequence of the Tne polymerase due to frame shift errors generated during 
the determination of the nucleotide sequence. However, an amino acid sequence 

- . _ . u,. tronc-iotino all three reading frames of 

ot tne me polymerase was uuiamtu ur n^^.— . & — — 

SEQ ID NO: 17, comparing these sequences with known polymerase amino acid 
sequences, and splicing the Tne polymerase sequence together to form the amino 
acid sequence set forth in SEQ ID NO: 18. The complete nucleotide sequence 
coding for Tne is shown in SEQ ID NO:2 and the complete amino acid sequence 
is shown in SEQ ID NO:3. 

SEQ ID NO:3 shows that the Tne sequence has an N-terminal methionine. 
It is not known with certainty whether the wild type Tne protein comprises an N- 
terminal methionine. It is possible to remove this N-terminal methionine 
according to methods well known to those of ordinary skill in the art, e.g. with 
a methionine amino peptidase. 

Example 6: Purification ofThermotoga neapolitana DNA 
Polymerase from E. coli 

Twelve grams of E. coli cells expressing cloned Tne DNA polymerase 
(DH10B/pSport-7mO were lysed by sonication (four thirty-second bursts with a 
medium tip at the setting of nine with a Heat Systems Ultrasonics Inc., model 375 



The sonicated extract was neatea at au 



WO 97/09451 



PCT/US96/14189 



-37- 



5 min. 50 mM KC1 and PEI (0.4%) was added to remove nucleic acids. The 
extract was centrifuged for clarification. Ammonium sulfate was added to 60%, 
the pellet was collected by centrifugation and resuspended in 10 ml of column 
buffer (25 mM Tris-HCl (pH 7.4), 8% glycerol, 0.5% EDTA, 5mM 
2-mercaptoethanol, 10 mM KC1). A Blue-Sepharose (Pharmacia) column, or 
preferably a Toso heparin (Tosohaas) column, was washed with 7 column 
volumes of column buffer and eluted with a 1 5 column volume gradient of buffer 
from lOmM to 2 M KC1. Fractions containing polymerase activity were pooled. 
The fractions were dialyzed against 20 volumes of column buffer. The pooled 

.. . rrr^r^ - . 1 CT^cr^U^^\ TVl P mlllTTin WAS 

fractions were applied to a iosoojuv milium v i^ww^,. 

washed to baseline OD 3S0 and elution effected with a linear 10 column volume 
gradient of 25 mM Tris (pH 7.4), 8% glycerol, 0.5 mM EDTA, 10 mM KC1, 5 
mM p-mercaptoethanol to the same buffer plus 650 mM KC1. Active fractions 
were pooled. 

Example 7: Characterization of Purified Tne DNA 
Polymerase 

1. Determination of the Molecular Weight of Thermotoga neapolitana 
DNA Polymerase 

The molecular weight of 100 kilodaltons was determined by 
electrophoresis in a 12.5% SDS gel by the method of Laemmli, U.K., Nature 
(Lond.) 227:680-685 (1970). Proteins were detected by staining with Coomassie 
brilliant blue. A 10 kd protein ladder (Life Technologies, Inc.) was used as a 
standard. 
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2. Method for Measuring Incorporation of [a 35 S]-dATP Relative to 
3 H-dA TP 

Incorporation of [aS]dATP was evaluated in a final volume of 500 fA of 
reaction mix, which was preincubated at 72°C for five minutes, containing either 
a [ 3 H]TTP nucleotide cocktail (100 uM each TTP, dATP, dCTP. dGTP with 
[ 3 H]TTP at 90.3 cpm/pmol), a nucleotide cocktail containing [aS]dATP as the 
only source of dATP (100 uM each [aSJdATP, dCTP, dGTP. TTP with 
[ a 35 S]dATP at 235 cpm/pmol). or a mixed cocktail (50 uM [aS]dATP, 50 uM 
dATP, 100 uM TTP ; 100 uM dCTP. 100 uM dGTP with [ 35 <xS] dATP at 118 
cpm/pmol and [ 3 H]TTP at 45.2 cpm/pmol) and 50 mM bicine, pH 8.5. 30 mM 
MgCU, 0.25 mg/ml activated salmon sperm DNA, 20% glycerol. The reaction 
was initiated by the addition of 0.3 units of T. neapolitana DNA polymerase or 
T. aquaticus DNA polymerase. At the times indicated a 25 y\ aliquot was 
removed and quenched by addition of ice cold EDTA to a final concentration of 
83 mM. 20 //l aliquots of the quenched reaction samples were spotted onto GF/C 
filters. Rates of incorporation were compared and expressed as a ratio of 
T. neapolitana to T. aquaticus. The incorporation of [a 35 S]dATP by 
T. neapolitana DNA polymerase was three-fold higher than that of T. aquaticus 
DNA polymerase. 

Example 8: Reverse Transcriptase Activity 

(A) n :(dT), 2 . lg is the synthetic template primer used most frequently to 
assay for reverse transcriptase activity of DNA polymerases. It is not specific for 
retroviral-like reverse transcriptase, however, being copied by many prokaryotic 
and eukaryotic DNA polymerases (Modak and Marcus, J. Biol. Chem. 252:\ 1-19 



llular, rephcauve DNA polymerases i 
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efficiently in me presence ofMg~(Modak and Marcus, J. Biol. Chem. 252:11-19 
(1977); Gerard ei al., Biochem. 73:1632-1641 (1974); Spadari and Weissbach, J. 
Biol Chem. 2^9:5809-5815 (1974)). In contrast most cellular, replicative DNA 
polymerases do not copy the synthetic template primer (C) n :(dG) 12 . I8 efficiently 
in presence of either Mn~ or Mg", but retroviral reverse transcriptases do. 
Therefore, in testing for the reverse transcriptase activity of a DNA polymerase 
with synthetic template primers, the stringency of the test increases in the 
following manner from least to most stringent: ( A) n :(dT) 12 ., g (Mn") < 
(A) B :(dT) 12 .„ (Mg~) « (C) n :(dG), 2 . 18 (Mn~) < (C) n :(dG) 12 .„ (Mg"). 

The reverse transcriptase activity of Twer DNA polymerase was compared 
with Thermus thermophilus (Tth) DNA polymerase utilizing both (A) n :(dT) 20 and 
(C) n :(dG) 12 , 8 . Reaction mixtures (50 ul) with (A) n :(dT) 20 contained 50 mM Tris- 
HCl (pH 8.4), 100 uM (A) B , 100 uM (dT) 20 , and either 40 mM KC1, 6 mM 
MgCl 2 , 10 mM dithiothreitol. and 500 uM [ 3 H]dTTP (85 cpm/pmole), or 100 mM 
KC1, 1 mM MnCl 2 . and 200 uM [ 3 H]dTTP (92 cpm/pmole). Reaction mixtures 
(50 ul) with (C) B :(dG) 12 ,. contained 50 mM Tris-HCl (pH 8.4), 60 uM (C) B , 24 
uM (dG) 12 , 8 , and either 50 mM KC1. 10 mM MgCU 10 mM dithiothreitol, and 
100 uM [ 3 H]dGTP (132 cpm/pmole). or 100 mM KC1. 0.5 mM MnCk and 200 
uM [ 3 H]dGTP (107 cpm/pmole). Reaction mixtures also contained either 2.5 
units of the Tth DNA polymerase (Perkin-Elmer) or 2.5 units of the Tne DNA 
polymerase. Incubations were at 45°C for 10 min followed by 75°C for 20 min. 

The table shows the results of determining the relative levels of 
incorporation of Tne and Tth DNA polymerase with (A) B :(dT) 20 and (C) n :(dG), 2 , g 
in the presence of Mg" and Mn". Tne DNA polymerase appears to be a better 
reverse transcriptase than Tth DNA polymerase under reaction conditions more 
specific for reverse transcriptase, i.e., in the presence of (A) B :(dT) 20 with Mg« and 
(C) n :(dG), ? . lg with Mn" or Mg". 
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DNA Polymerase Activity of Tth and Tne 
DNA Polymerase with (A) n :(dT) 20 and (C) n :(dG), 2 .,» 



5 





DNA Polymerase Activity 




(pMoles Complementary 


[ 3 H]dNTP Incorporated) 


Enzyme 


(A) n :(dT) 20 


(C)„:(dG) 




Mg~ Mn** 


Me" Mn" 


Tne 


161.8 188.7 


0.6 4.2 


Tth 


44.8 541.8 


0 0.9 



Example 9: Construction of Thermotoga Neapolitana 3'-to-5' 
Exonuclease Mutant 



was 



The amino acid sequence of portions of the Tne DNA polymerase 
compared with other known DNA polymerases such as E. coli DNA 
polymerase 1, Taq DNA polymerase, T5 DNA polymerase, and T7 DNA 
polymerase to localize the regions of 3 '-to-5' exonuclease activity, and the dNTP 
binding domains within the DNA polymerase. One of the 3 '-to-5' exonuclease 
domains was determined based on the comparison of the amino acid sequences 
of various DNA polymerases (Blanco. L., et al. Gene 112: 139-144 (1992); 
Braithwaite and Ito. Nucleic Acids Res. 21 : 787-802 (1993)) is as follows: 



Tne 318 PSFALD'LETSS 328 (SEQ ID NO: 4) 
Poll 350 PVFAFDTETDS 360 (SEQ ID NO:5: Braithwaite and Ito, 
supra) 

T5 133 GPVAFDSETSA 143 (SEQ ID NO:6; Braithwaite and Ito, 

supra) 

t-7 i AtlVSniFANA 10 (SEQ ID NO:7: Braithwaite and Ito, 



WO 97/09451 



PCT/US96/14189 



-41- 



As a first step to make the Tne DNA polymerase devoid of 3'-5' 
exonuclease activity, a 2kb Sph fragment from pSport-Tne was cloned into 
M13mpl9 (LTI, Gaithersburg, MD). The recombinant clone was selected in£. 
coli DH5aF'IQ (LTI, Gaithersburg, MD). One of the clones with the proper 
5 insert was used to isolate uracilated single-stranded DNA by infecting E. coli 

CJ236 (Biorad, California) with the phage particle obtained from E. coli 
DH5aF'IQ. An oligonucleotide, GA CGT TTC AAG CGC TAG GGC AAA 
AG A (SEQ ID NO: 8) was used to perform site directed mutagenesis. This site- 
directed mutagenesis converted Asp 323 (indicated as * above) to Ala 323 . An 
10 £co47IIl restriction site was created as part of this mutagenesis to facilitate 

screening of the mutant following mutagenesis. The mutagenesis was performed 
using a protocol as described in the Biorad manual (1987) except T7 DNA 
polymerase was used instead of T4 DNA polymerase (USB, Cleveland, OH). 
The mutant clones were screened for the £co47III restriction site that was created 
15 in the mutagenic oligonucleotide. One of the mutants having the created 

£co47III restriction site was used for further study. The mutation Asp 3:3 to Ala 323 
has been confirmed by DNA sequencing. 

To incorporate the 3'-to-5' exonuclease mutation in an expression vector, 
the mutant phage was digested with Sphl and Hmdlll. A 2 kb fragment 
20 containing the mutation was isolated. This fragment was cloned in pUC-Tne to 

replace the wild type fragment. See Figure 6A. The desired clone. pUC-Tne 
(3'-5'), was isolated. The presence of the mutant sequence was confirmed by the 
presence of the unique Eco47lll site. The plasmid was then digested with Sstl 
and Hindlll. The entire mutant polymerase gene (2.6 kb) was purified and cloned 
25 into Sstl and Hindlll digested P Trc99 expression vector (Pharmacia. Sweden). 

The clones were selected in DH10B (LTI, Gaithersburg, MD). The resulting 
plasmid was designated P TrcTne35. See Figure 6B. This clone produced active 
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Example 10: Phenylalanine to Tyrosine Mutant 

As discussed supra, the polymerase active site including the dNTP 
binding domain is usually present at the carboxyl terminal region of the 
polymerase. The sequence of the Tne polymerase gene suggests that the amino 
acids that presumably contact and interact with the dNTPs are present within the 
694 bases starting at the internal BamHl site. See Figure 4 and Figs. 5A and 5B. 
This conclusion is based on homology with a prototype polymerase E coli DNA 
polymerase 1. See Polisky et al.. J. Biol. Chem. 265:14579-14591 (1990). The 
sequence of the carboxyl terminal portion of the polymerase gene is shown m 
Figs. 5A and 5B. Based upon this sequence, it is possible to compare the amino 
acid sequence within the O-helix for various polymerases. The complete 
sequence of the DNA polymerase is shown in SEQ ID NO:3. The corresponding 
O-helix region band on the sequence in Figs. 5A and 5B includes ammo acids 59 
to 72. 

Tne 722 RRVGKMVNFSIIYG 735 (SEQ ID NO:9) 

Poll 754 RRSAKAINFGLIYG 767 ( SEQIDNO:10) 

T5 562 RQAAKAITFGILYG 575 (SEQIDNO:ll) 

T7 518 RDNAKTFIYGFLYG 531 (SEQIDNO:12) 

Tag 659 RRAAKTINFGVLYG 672 (SEQ ID NO:13) 

It was shown that by replacing the phenylalanine residue of Tag DNA 
polymerase, the polymerase becomes non-discriminating against non-natural 
nucleotides such as dideoxynucleotides. See application Serial No. 08/525,087 
entitled "Mutant DNA Polymerases and Use Thereof of Deb K. Chatterjee, filed 
September R. specifically incorporated herein by reference. The mutation 

in place ot the phenylalanine, ana 17 DNA polymerase i, nv<ii-viS.>.n..i,nuun>- 
against dideoxynucleotides. The corresponding residue, Phe 7 « of E. coli Poll is 
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an amino acid that directly interacts with nucleotides. (Joyce and Steitz, Ann. 
Rev. Biochem. 63:777-822 (1994); Astake, M.J. .J. Biol. Chem. 270:1945-1954 
( 1 995)). A similar mutant of Tne DN A polymerase was prepared. 

In order to change Phe 730 of the Tne polymerase to a Tyr 730 as numbered 

5 in SEQ ID NO:3, site directed mutagenesis was performed using the 

oligonucleotide GTA TAT TAT AGA GTA GTT AAC CAT CTT TCC A. (SEQ 
ID NO: 14). As. part of this oligonucleotide directed mutagenesis, a Hpal 
restriction site was created in order to screen mutants easily. The same uracilated 
single-stranded DNA and mutagenesis procedure described in Example 9 were 

10 used for this mutagenesis. Following mutagenesis, the muianis weie screened for 

the Hpal site. Mutants with the desired Hpal site were used for further study. 
The mutation has been confirmed by DNA sequencing. 

The Phe 750 to Tyr 730 mutation was incorporated into pUC-Tne by replacing 
the wild type Sphl -Hindlll fragment with the mutant fragment obtained from the 

15 mutant phage DNA. The presence of the desired clone, pUC-TneFY, was 

confirmed by the presence of the unique Hpal site, see Figure 6A. The entire 
mutant polymerase gene was subcloned into pTrc99 as an Sstl-Hindlll fragment 
as described above in DH10B. The resulting plasmid was designated 
pTrcTneFY. (Figure 6B). The clone produced active heat stable polymerase. 

20 Example 11: 3'-to-5' Exonuclease and Phe 730 -+ Tyr 730 

Double Mutants 

In order to introduce the 3 '-5' exonuclease mutation and the 
Phe 730 ->Tyr 730 mutation in the same expression vector, pTrc99, it was necessary 
to first reconstitute both mutations in the pUC-Tne clone. See Figure 7. Both the 
25 pUC-Tne (3'-5') and the pUC-TneFY were digested with BamUl. The digested 

BamHl fragment (4.4 kb) was purified from pUC-Tne (3'-5') digested DNA and 
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the smallest BamKl fragment (0.8 kb) containing the Phe™-Tyr™ mutation was 
purified and ligated to generate P UC-Tne35FY. The proper orientation and the 
presence of both mutations in the same plasmid was confirmed by £co47III, 
Hpa\, and Sph\-Hind\\\ restriction digests. See Figure 7. 

The entire polymerase containing both mutations was subcloned as a Sstl- 
HindHI fragment in P Trc99 to generate pTrcTne35FY in DH10B. The clone 
produced active heat stable polymerase. 

Example 12: 3' -to- 5' Exonuclease, 5'- to-3' Exonuclease, 

i r»i. -7?fl v T-..„7J0 T\.;,,/^ A/ftiinvtlv 

ana rne r iy J"/" 1 - 1 "" " 

In most of the known polymerases, the 5 '-to-3' exonuclease activity is 
present at the amino terminal region of the polymerase (Ollis. D.L., et al, Nature 
313, 762-766, 1985; Freemont, P.S., et al. Proteins 1, 66-73. 1986; Joyce, CM., 
Curr. Opin. Struct. Biol. 7: 123-129 (1991). There are some conserved amino 
acids that are implicated to be responsible for 5 '-to-3' exonuclease activity 
(Gutman and Minton, Nucl Acids Res. 21. 4406-4407, 1993). See supra. It is 
known that 5 '-to-3' exonuclease domain is dispensable. The best known example 
is the Klenow fragment of E. coli Pol I. The Klenow fragment is a natural 
proteolytic fragment devoid of 5'-to-3' exonuclease activity (Joyce, CM., et al., 
J. Biol. Chem. 257. 1958-1964, 1990). In order to generate an equivalent mutant 
for Tne DNA polymerase devoid of 5 '-to-3' exonuclease activity, the presence 
of a unique Sph\ site present 680 bases from the Sstl site was exploited. 
P UC-Tne35FY was digested with HindlW, filled-in with Klenow fragment to 
generate a blunt-end, and digested with Sphl. The 1.9 kb fragment was cloned 
into an expression vector pTTQ19 (Stark, M.J.R.. Gene 51, 255-267, 1987) at the 
Sphl-Smal sites and was introduced into DH10B. This cloning strategy generated 



DNA polymerase. This clone is designated as pi l"gIne535FY. ihe ciun 
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produced active heat stable polymerase. No exonuclease activity could be 
detected in the mutant polymerase as evidenced by lack of presence of unusual 
sequence ladders in the sequencing reaction. This particular mutant polymerase 
is highly suitable for DNA sequencing. 

Example 13: 5'-to-3' Exonuclease Deletion and Phe 730 -Tyr 730 
Substitution Mutant 

In order to generate the 5'-3' exonuclease deletion mutant of the Tne 
DNA polymerase Phe 730 - 1 yr 730 mutant, the i.8 kb Sphl-Spel fragment of 
P TTQTne535FY was replaced with the identical fragment of pUC-Tne FY. See 
Fig. 8. A resulting clone, pTTQTne5FY, produced active heat stable DNA 
polymerase. As measured by the rate of degradation of a labeled primer, this 
mutant has a modulated, low but detectable, 3'-5' exonuclease activity compared 
to wild type Tne DNA polymerase. M13/pUC Forward 23-Base Sequencing 
Primer™, obtainable from LTI. Gaithersburg, MD, was labeled at the 5' end with 
[P 32 ] ATP and T4 kinase, also obtainable from LTI. Gaithersburg, MD, as 
described by the manufacturer. The reaction mixtures contained 20 units of either 
wild-type or mutant Tne DNA polymerase. 0.25 pmol of labeled primer, 20 mM 
tricine, pH 8.7, 85 mM potassium acetate, 1.2 mM magnesium acetate, and 8% 
glycerol. Incubation was carried out at 70°C. At various time points, 10 ul 
aliquots were removed to 5 ul cycle sequencing stop solution and were resolved 
in a 6 % polyacrylamide sequencing gel followed by andoradiography. While the 
wild-type polymerase degraded the primer in 5 to 15 minutes, it took the mutant 
polymerase more than 60 minutes for the same amount of degradation of the 
primer. Preliminary results suggest that this mutant polymerase is able to amplify 
more than 12 kb of genomic DNA when used in conjunction with Tag DNA 
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Example 14: Purification of the Mutant Polymerases 

The purification of the mutant polymerases was done essentially as 
described in U.S. Patent Application Serial No. 08/370,190, filed January 9. 
1995, entitled "Cloned DNA Polymerases for Thermotoga neapolitana," and as 
in Example 6, supra, with minor modifications. Specifically, 5 to 10 grams of 
cells expressing cloned mutant Tne DNA polymerase were lysed by sonication 
with a Heat Systems Ultrasonic, Inc. Model 375 machine in a sonication buffer 
comprising 50 mM Tris-HCl (pH 7.4); 8% glycerol; 5 mM 2-mercaptoethanoL 
10 mM NaCl, 1 mM EDTA. and 0.5 mM PMSF. The sonication sample was 
heated at 75°C for 15 minutes. Following heat treatment, 200 mM NaCl and 
0.4% PEI was added to remove nucleic acids. The extract was centrifuged for 
clarification. Ammonium sulfate was added to 48%, the pellet was resuspended 
in a column buffer consisting of 25 mM Tris-HCl (pH 7.4); 8% glycerol; 0.5% 
EDTA; 5 mM 2-mercaptoethanol; 10 mM KC1 and loaded on a heparin agarose 
(LTI) column. The column was washed with 10 column volumes using the 
loading buffer and eluted with a 1 0 column volume buffer gradient from 1 0 mM 
to 1 M KC1. Fractions containing polymerase activity were pooled and dialyzed 
in column buffer as above with the pH adjusted to 7.8. The dialyzed pool of 
fractions were loaded onto a MonoQ (Pharmacia) column. The column was 
washed and eluted as described above for the heparin column. The active 
fractions are pooled and a unit assay was performed. 

The unit assay reaction mixture contained 25 mM TAPS (pH 9.3), 2 mM 
MgCl 2 , 50 mM KC1, 1 mM DTT, 0.2 mM dNTPs, 500 ug/ml DNAse I treated 
salmon sperm DNA, 21 mCi/ml [«P 32 ] dCTP and various amounts of polymerase 
in a final volume of 50 ul. After 10 minutes incubation at 70°C, 10 ul of 0.5 M 
FDTA was added to the tube. TCA precipitable counts were measured in GF/C 
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Example 15: DNA Sequencing with the Mutant Polymerases 

M 1 3/pUC 23-base forward sequencing primer was 32 P-end-labeled for use 
in sequencing by incubating the following mixture at 37°C for 10 minutes: 60 
mM Tris-HCl (pH 7.8), 10 mM MgCl 2 , 200 mM KG. 0.2 uM primer, 0.4 uM (2 
uCi/ul) [y- 32 P] ATP, 0.2 U/ul T4 polynucleotide kinase. Labeling was terminated 
by incubating at 55 °C for 5 minutes. 

Four 10 ul base-specific sequencing reactions were set up for each test. 
The polymerase and the ddNTP concentrations were varied as follows: 



Test 


Tne DNA 
polymerase 


|ddATP| 


|ddCTP| 


|ddGTP] 


|ddTTP] 


1 


wild-type 


0.4 mM 


0.2 mM 


0.04 mM 


0.4 mM 


2 


TneFY 


0.4 mM 


0.2 mM 


0.04 mM 


0.4 mM 


3 


TneFY 


0.04 mM 


0.02 mM 


0.004 mM 


0.04 mM 


4 


TneFY 


0.004 mM 


0.002 mM 


0.0004 mM 


0.004 mM 


5 


Tne35FY 


0.4 mM 


0.2 mM 


0.04 mM 


0.4 mM 


6 


Tne35FY 


0.04 mM 


0.02 mM 


0.004 mM 


0.04 mM 


7 


Tne35FY 


0.004 mM 


0.002 mM 


0.0004 mM 


0.004 mM 


8 


Tne535FY 


0.4 mM 


0.2 mM 


0.04 mM 


0.4 mM 


9 


Tne535FY 


0.04 mM 


0.02 mM 


0.004 mM 


0.04 mM 


10 


Tne535FY 


0.004 mM 


0.002 mM 


0.0004 mM 


0.004 mM 



Other components of the reaction were held constant: 1.1 nM pUC 18 DNA, 22 
nM »p-end-labeled primer, 30 mM Tris-HCl (pH 9.0), 5 mM MgCl 2 , 50 mM 
KC1, 0.05% (w/v) W-l, 0.056 U/ul DNA polymerase (see table), 20 uM dATP, 
20 uM dCTP. 20 uM 7-deaza-dGTP. 20 uM dTTP. Samples were incubated in 

, . ,„ ocr for i minutes followed hv 20 cvcles of (30 seconds at 



Reactions 



(95% (v/v) formarmde, 10 mM EDTA (pH 8.0), 0.1% (w/v) bromophenol blue, 
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0.1% (w/v) xylene cyanol and denatured for two minutes at 70°C. Three ul 
aliquots were separated on a 6% TBE/urea sequencing gel. The dried gel was 
exposed to BioMAX-MR x-ray film for 16 hours. 



Results 



Cycle sequencing reactions using P 32 end-labeled primers were prepared 
using wild-type Tne DNA polymerase and each of the three mutants, TneFY, 
Tne35FY, and Tne535FY. All four of the polymerases produced sequencing 

_ i _ r\ i ~,, nn ^ir>r, lo^/-W when thp Tnn 

ladders, lhe met Y mutant gave omy a y ^ 4 u^n— 6 — , 

cycle sequencing reaction conditions were used. This is suggestive of premature 
termination due to efficient ddNTP incorporation. Diluting the 
dideoxynucleotides by a factor of 100 extended the ladder to about 200 bases. 
The F-Y mutation in the TneTY polymerase therefore allowed 
dideoxynucleotides to be incorporated at a much higher frequency than for wild- 
type polymerase. The Tne35FY mutant demonstrated a similar ability to 
incorporate dideoxynucleotides. In this case, the sequence extended to beyond 
400 bases and the excess P 32 end-labeled M13/pUC forward 23-Base sequencing 
primer band remained at the 23-base position in the ladder. The persistence of 
the 23-base primer band confirmed that the 3' - 5' exonuclease activity had been 
significantly reduced. The Tne535¥Y mutant performed similarly to the 
T«e35FY mutant except that the signal intensity increased by at least fivefold. 
The background was very low and relative band intensities were extremely even, 
showing no patterns of sequence-dependent intensity variation. 
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Example 16: Generation of 5' -3' exonuclease mutant of 
full length Tne DNA polymerase 

1. Identification of Two Amino Acids Responsible for 5'-3' Exonuclease 
Activity 

Tne DNA polymerase contains three enzymatic activities similar to E coli 
DNA polymerase 1: 5'-3' DNA polymerase activity, 3'-5' exonuclease activity 
and 5'-3' exonuclease activity. This example is directed to the elimination of the 
5 '-3' exonuclease activity in full length Tne DNA polymerase. Gutman and 
Minton (Nucleic Acids Res. 1993, 21, 4406-4407) identified six (A-F) conserved 
5'-3' exonuclease domains containing a total of 10 carboxylates in various DNA 
polymerases in the poll family. Seven out of 10 carboxylates (in domains A, D 
and E) have been implicated to be involved in divalent metal ions binding as 
judged from the crystal structure (Kim et al. Nature, 1995, 376, 612-616) of Taq 
DNA polymerase. However, there was no clear demonstration that these 
carboxylates are actually involved 5 '-3 'exonuclease activity. In order to find out 
the biochemical characteristics of some of these carboxylates, two of the aspartic 
acids in domains A and E were chosen for mutagenesis. The following aspartic 
acids in these two domains were identified: 

Tne DNA polymerase: 5 F L F D 8 G T 10 (domain A) 

Taq DNA polymerase: 15 L L V D 18 G H 20 



and 



Tne DNA polymerase: 



132 SL I TG D' 37 K D M L 141 (domain E) 
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2. Isolation of Single Stranded DNA for Mutagenesis 

Single stranded DNA was isolated from pSportTne (see infra). pSportTne 
was introduced into DH5aF'IQ (LTI, Gaithersburg, MD) by transformation. A 
single colony was grown in 2 ml Circle Grow (Bio 101, CA) medium with 
ampicillin at 37°C for 16 hrs. A 10 ml fresh media was inoculated with 0.1 ml 
of the culture and grown at 37°C until the A590 reached approximately 0.5. At 
that time, 0.1 ml of M13K07 helper phage (1X10" pfu/ml, LTI) was added to 
the culture. The infected culture was grown for 75 min. Kanamycin was then 
added at 50 ug/ml, and the culture was grown ovemigrn (16 las.). The culture 
was spun down. 9 ml of the supernatant was treated with 50 ug each of RNaseA 
and DNasel in the presence of 10 mM MgCl 2 for 30 min. at room temperature. 
To this mixture, 0.25 volume of a cocktail of 3M ammonium acetate plus 20% 
polyethylene glycol was added and incubated for 20 min. on ice to precipitate 
phage. The phage was recovered by centrifugation. The phage pellet was 
dissolved in 200 ul of TE (10 mM Tris-HCl (pH 8) and 1 mM EDTA). The 
phage solution was extracted twice with equal volume of buffer saturated phenol 
(LTI, Gaithersburg, MD). twice with equal volume of phenol :chloroform:isoamyl 
alcohol mixture (25:24:1. LTI. Gaithersburg, MD) and finally, twice with 
chloroform: isoamyl alcohol (24:1). To the aqueous layer, 0.1 volume of 7.5 M 
ammonium acetate and 2.5 volume of ethanol were added and incubated for 15 
min. at room temperature to precipitate single stranded DNA. The DNA was 
recovered by centrifugation and suspended in 200 ul TE. 

3. Mutagenesis ofD* and D' 37 

Two olicos were designed to mutagenize D 8 and D ,J7 to alanine. The 



TATGAAAATC 3' (SEQ ID NO:16) (D137A). In the D8A ohgo a NgoM v 
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(bold underlined) and in the oligo D137A a Kasl (bold underlined) site was 
created for easy identification of clones following mutagenesis. 200 pmol of each 
oligo was kinased according to the Muta-gene protocol (Bio-Rad, CA) using 5 
units of T4 Kinase (LTI, Gaithersburg, MD). 200 ng of single stranded DNA was 
annealed with 2 pmol of oligo according to the Muta-gene protocol. The reaction 
volume was 10 ul. Following the annealing step, complementary DNA synthesis 
and ligation was carried out using 5 units of wild-type T7 DNA polymerase 
(USB, Ohio) and 0.5 unit T4 ligase (LTI). 1 ul of the reaction was used to 
transform a MutS E coli (obtainable from Dr. Paul Modrich at the Duke 
University, NC) and selected in agax plates containing ampicilhn. A control 
annealing and synthesis reaction was carried out without addition of any oligo to 
determine the background. There were 50-60 fold more colonies in the 
transforation plates with the oligos than without any oligo. Six colonies from 
each mutagenic oligo directed synthesis were grown and checked for respective 
restriction site (/VgoAIV or Kasl). For D8 A (AfeoAIV), 4 out of 6 generated two 
fragments (3 kb and 4.1 kb). Since pSportTne has an Ngo AIY site near the fl 
intergenic region, the new NgoAlW site within the Tne DNA polymerase 
produced the expected fragments. The plasmid was designated as 
pSportTneNgoAIV. For D137A (Kasl), 5 out of 6 clones produced two expected 
fragments of 1 .1 kb and 6 kb in size. Since pSportTne has another Kasl site, the 
newly created Kasl site generated these two expected fragments. The plasmid 
was designated as pSportTneKasI. Both D8A and Dl 37A mutations have been 
confirmed by DNA sequencing. 

4. Reconstruction of the Mutant Polymerase into Expression Vector 

During, the course of expression of Tne DNA polymerase or mutant Tne 



similar to above mutagenesis technique glycine vva 
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acid in pSportTne. A mutation in the corresponding amino acid in E. coli DNA 
polymerasel (polA214. domain F) was found to have lost the 5'-3' exonuclease 
activity (Gutman and Minton, see above). An Ssp\ site was created in the mutant 
polymerase. Second, a 650 bp Sstl-Sphl fragment containing the G195D 
mutation was subcloned in P UCTne35FY (see infra) to replace the wild type 
fragment. This plasmid was called P UCTne3 022. Finally, the entire mutant Tne 
DNA polymerase was subcloned from P UCTne3022 into pTTQl 8 as Sstl-Hindlll 
fragment to generate pTTQTneSeqS 1 . To introduce the mutation D8A or Dl 37A 
in this expression vector, the 650 bp Sstl-Sphl was replaced with the same Sstl- 
Sphl fragment from pSport'TneNgoAIV or pSportTiicKasI. The plasmids were 
designated as pTTQTneNgo(D8A) and pTTQTneKas(D137A), respectively. 

5. Confirmation of the Mutations by DNA Sequencing 

DNA sequencing of both mutant polymerases confirmed the presence of 
the restriction site NgoAlV as well as the mutation D8A; and Kasl site as well 
as the mutation D137A. Also confirmed by DNA sequencing was the presence 
of the mutation D323A and the EcoAllll restriction site in the 3 '-5 'exonuclease 
region. In addition, confirmed by DNA sequencing was the F730Y mutation and 
the Hpal restriction site in the O-helix region of the mutant Tne DNA 
polymerase. 

6. 5' -3' exonuclease Activity of the Mutant Tne DNA Polymerases 

The full length mutant DNA polymerase was purified as described above. 
The 5 '-3 'exonuclease activity was determined as described in the LTI catalog. 
Briefly. 1 pmol of labeled ( 32 P) Haelll digested X DNA (LTI) was used for the 

6 and 10 units of either wild type or mutant 1 ne DNA polymerase m a j0 ui 
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reaction. The reaction mix was incubated for 1 hr at 72°C. A 10 ul aliquot was 
subjected to PEI-cellulose thin layer chromatography and the label released was 
quantised by liquid scintillation. In this assay, both D8A and D137A mutants 
showed less than 0.01% label release compared to the wild type Tne DNA 
polymerase. The result demonstrates that in both D8A and D137A mutants the 
5'-3'exonuclease activity has been considerably diminished. Thus, it has been 
confirmed for the first time that these two aspartates are involved with the 5 '-3' 
exonuclease activity. 

7. DNA Sequencing Characteristics of the Mutant DNA Polymerases 

Four separate base-specific reactions of the following composition were 
set up for each Tne polymerase mutant. 6.5 nM pUC 18, 1 1 1 nM M13/pUC 23 
base forward sequencing primer. 30 mM Tns-HCl (pH 9.0), 5 mM MgCl 2 , 10 
mM NaCl, 10 mM DTT, 0.05% (w/v) W-l, 0.00185 U/ul inorganic 
pyrophosphatase, 0.37 uG/ul (0.37 uM) [«-»S]dATP, 16-7 uM a-thio-dATP, 
16.7 uM dCTP, 16.7 uM 7-deaza-dGTP, 16.7 uM dTTP, and either 0.042 uM 
ddATP, 0.3 uM ddCTP, 0.255 uM ddGTP or 0.375 uM ddTTP. In these 
reactions, the concentrations of the various mutants were: 0.185 U/ul Tne535FY, 
or 0.170 U/ul D8A, or 0.185 U/ul D137A. Reaction volumes were 6 ul each. 
Sample tubes were incubated in an MJ Research DNA Engine thermal cycler at 
95°C for 3 minutes, followed by 20 cycles of (30 seconds at 95°C, 30 seconds 
at 55°C and 60 seconds at 70°C), and 10 cycles of (30 seconds at 95°C and 60 
seconds at 70°C). Reactions were terminated with 3 ul of stop solution (95% 
(v/v) formamide, 10 mM EDTA (pH 8.0), 0.1% (w/v) bromophenol blue, 0.1% 
(w/v) xylene cyanol) and denatured for two minutes at 70°C. Three ul aliquots 
were separated, on a 6% TBE/urea sequencing gel. The dried gel was exposed to 
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mutants of Tne DNA polymerase produced equivalent sequea 
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equal band intensity in all 4 lanes comparable to another 7>?e DNA polymerase 
where the 5'-exonuclease domain was deleted (Tne535FY). This result also 
suggests that both D8A and D137A mutants are devoid of 5'-exonuclease activity 
since no false bands are seen in the sequencing ladder, a characteristic of 5 '-3' 
5 exonuclease containing DNA polymerase. 

Example 17: Advantages ofTneDNA Polymerase Mutant in 
Sequencing Reactions 

In this exampie. the Tne DNA polymerase of Example 12 was used which 
has the Phe 730 -+Tyr 730 mutation (making it non-discriminatory for dNTPs over 
10 ddNTPs), the Asp 323 -Ala 323 mutation (which substantially reduces 3'-to-5' 

exonuclease activity), and the N-terminal 219 amino acid deletion mutation 
(which eliminates 5'-to-3' exonuclease activity). 

Sequenase Ver 2.0™ is a modified T7 DNA polymerase sold by 
Amersham International pic, Little Chalfont, England. 
!5 Taq DNA polymerase was purchase from LTI, Gaithersburg, MD. 

Thermo Sequenase™ is a Taq F^Y mutant containing a 5 '-exonuclease 
deletion sold by Amersham International pic. Little Chalfont, England. 

AmpliTaq FS™ is a Taq F^Y mutant believed to contain a Gly 37 
mutation sold by Perkin Elmer ABI, Foster City, CA. 
20 Sequitherm™ is a thermophilic DNA polymerase sold by Epicenter, 

Madison, WI. 

Methods 

35 S cycle Sequencing with Tne DNA Polymerase 

25 up for each template: 6.5 nM dsDN A. lii nM primer, 30 inM Tiib-HCl (pliy.0). 
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5 mM MgCl 2; 10 mM NaCl, 10 mM DTT, 0.05% (w/v) W-L, 0.185 U/jiL Tne 
DNA polymerase mutant, 0.001 85 U/ul thermophilic inorganic pyrophosphatase, 
0.37 uCi/ul (0.37 ,iM) [a- 35 S]dATP, 16.7 a-thio-dATP, 16.7 uM dCTP, 16.7 
^M 7-deaza-dGTP, 16.7 uM dTTP, and either 0.042 ddATP, 0.3 uM ddCTP, 
0.255 uM ddGTP or 0.375 \xM ddTTP. Reaction volumes are 6 ul each. Sample 
tubes are incubated in an MJ Research DNA Engine thermal cycler at 95°C for 
3 minutes, followed by 20 cycles of (30 seconds at 95°C, 30 seconds at 55°C and 
60 seconds at 70°C), and 10 cycles of (30 seconds at 95°C and 60 seconds at 
70°C). Reactions are terminated with 3 ^1 of stop solution (95% (v/v) 

.. . „ r-^-r . , .ti o m A 10/ /„„\,\ Kmrnnnhprifll blue. 0.1% (wM 

tormamide. iu mm cui/\ yjn o.u,. u.i ,» r 

xylene cyanol) and denatured for 2 minutes at 70°C. Three microliter aliquots are 
separated on a 6% TBE/urea sequencing gel. The dried gel is exposed to Kodak 
BioMAX x-ray film at room temperature for approximately 18 hours, unless 
otherwise specified. 

12 P-end Labeled Primer Cycle Sequencing with Tne DNA Polymerase 

The sequencing primer is labeled by incubating the following 5 yx\ 
reaction for 10 minutes at 37°C: 60 mM Tris-HCl. 10 mM MgCl 2 , 200 mM KCL 
0.6 uM primer. 0.4 uM (2 nCi/nD [ Y - 32 P]ATP, 0.2 U/nl T4 polynucleotide 
kinase. The reaction is stopped by incubating 5 minutes at 55 °C. Four separate 
base-specific reactions of the following composition are then set up for each 
template: 1.1 nM dsDNA, 67 nM 32 P-end-labeled primer, 30 mM Tris-HCl (pH 
9.0), 5 mM MgCK. 50 mM KC1, 0.05% (w/v) W-l, 0.185 U/ul Tne DNA 
polymerase, 0.00185 U/ul thermophilic inorganic pyrophosphatase, 20 uM 
dATP, 20 uM dCTP, 20 ^M 7-deaza-dGTP, 20 ^M dTTP, and either 0.4 uM 
ddATP, 0.4 uM ddCTP, 0.4 uM ddGTP or 0.4 jiM ddTTP. Reaction volumes are 



30 seconds at 5i"L and 6u seconas at /u ' 
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95 °C and 60 seconds at 70 °C). Reactions are terminated with 5 ul of stop 
solution (95% (v/v) formamide, 10 mM EDTA (pH 8.0), 0.1% (w/v) 
bromophenol blue. 0.1% (w/v) xylene cyanol) and denatured for 2 minutes at 
70°C. Three ul aliquots are separated on a 6% TBE/urea sequencing gel. The 
dried gel is exposed to Kodak BioMAX x-ray film at room temperature for 
approximately 18 hours, unless otherwise specified. 

Single-extension Sequencing with Tne DNA Polymerase 

~. . • rvKTA „.j„„o.,,ro/lHcnMA TVipTTNJAi<= 

l nis reaction requncs ciuici ssui^n. ^ uw.uiu.w — - 

annealed to primer in a 10 ul volume by heating for five minutes at 50°C under 
the following reaction conditions: 1 50 nM dsDNA and 1 50 nM primer or 50 nM 
ssDNA and 50 nM primer with 60 mM Tris-HCl (pH 9.0), 60 mM KC 1 , 1 0 mM 
MgCl 2 , 0.1% (w/v) W-l . The following labeling reaction is then incubated for 
five minutes at 50°C in a 15.5 ul volume: lOul annealed DNA-primer 0.32 uCi/ul 
(0.32 uM) [a- 35 S]dATP, 48.4 mM Tris HC1 (pH 9.0), 48.4 mM KC1, 8.1 mM 
MgCl 2 , 194 nM dCTP, 194 nM 7-deaza-dGTP. 194 nM dTTP, 6.5 nM DTT, 
0.081% (w/v) W-l. 0.32 U/ul Tne DNA polymerase, 0.0032 U/ul thermophilic 
inorganic pyrophosphatase. The label mixture is then dispensed into four base- 
specific reaction tubes. Each tube contains a total reaction volume of 6 ul and is 
incubated for 5 minutes at 70°C under the following conditions: DNA-labeled 
primer 0.19 uCi/ul (0.19 uM) [a- 35 S]dATP, 28 mM Tris-HCl (pH 9.0), 28 mM 
KC1, 4.7 mM MgCl 2 , 42 uM dATP, 42 uM dCTP, 42 uM 7-deaza-dGTP, 42 uM 
dTTP, 3.8 mM DTT, 0.047% (w/v) W-l, 0.19 U/ul Tne DNA polymerase, 0.0019 
U/ul thermophilic inorganic pyrophosphatase and either 0.83 uM ddATP, 0.83 
uM ddCTP, 0.83 uM ddGTP or 0.83 \iU ddTTP. Reactions are terminated by 
adding 4ul of stop solution (95% (v/v) formamide. 10 mM EDTA (pH 8.0), 0.1% 
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dried gel is exposed to Kodak BioMAX x-ray film at room temperature for 
approximately 2 hours, unless otherwise specified. 

Fluorescent Dye Primer Sequencing with Tne DNA Polymerase 

Four base-specific reactions are set up for each template. The A and C 
reaction volumes are 5 ul and the G and T reaction volumes are 10 ul. The 
composition of the reactions are as follows: 20 nM dsDNA or 10 nM ssDNA, 
with 30 mM Tris-HCl (pH 9.0), 30 mM KC1, 5 mM MgCl 2 , 0.05% (w/v) W-l, 

™ * , , . ™ x , J^i-r. on ..\a n ACVTV On „M HTTP 0 ~>Q 1 T/til TrtP 

JIM UA l r , H1V1 I I , /.U H1V1 / "Ul,UiU-uu w , — | i - r 

DNA polymerase, 0.0029 U/ul thermophilic inorganic pyrophosphatase. Each 
of the four tubes also contains a base-specific dye primer and ddNTP as follows: 

A: 0.4 uM JOE dye primer, 0.4 uM ddATP 

C: 0.4 uM FAM dye primer, 0.4 uM ddCTP 

G: 0.4 uM TAMRA dye primer, 0.4 uM ddGTP 

T: 0.4 uM ROX dye primer, 0.4 uM ddTTP 
Sample tubes are incubated in a thermal cycler at 95°C for 3 minutes, followed 
by 20 cycles of (30 seconds at 95°C, 30 seconds at 55°C and 60 seconds at 70°C), 
and 10 cycles of (30 seconds at 95°C and 60 seconds at 70°C). Reactions are 
pooled, purified over a CentriSep spin column, and dried. The dried pellet is 
dissolved in 3 ul of 83% formamide, 4.2 mM EDTA (pH 8.0) and heated for 2 
minutes at 90°C just before loading the entire sample on a 4.75% 
polyacrylamideATBE/urea gel in an ABI 373 Stretch machine. The gel is run at 
32 watts for 14 hours. 

Fluorescent Dye Terminator Sequencing with Tne DNA Polymerase 



primer, 3U mM Ins-HCi {pn v.u), ju mm ro^i 
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1, 0.6 mM dATP, 0.6 mM dCTP, 1.8 raM dITP, 0.6 mM dTTP, 0.5 U/ml Tne 
DNA polymerase, 0.005 U/ul thermophilic inorganic pyrophosphatase. The 
reaction also includes four base-specific dye terminators at a final concentration 
16-fold lower than the original concentration supplied by ABI. The sample tube 
is incubated in a thermal cycler for 25 cycles of (30 seconds at 96°C, 1 5 seconds 
at 50°C and 4 minutes at 60°C). The reaction is purified over a CentriSep spin 
column, and dried. The dried pellet is dissolved in 3 ul of 83% formamide, 4.2 
mM EDTA (pH 8.0) and heated for 2 minutes at 90°C just before loading the 
entire sample on a 4.75% polyacrylamide/TBE/urea gel in an ABI 373 Stretch 
machine. The gel is run at 32 watts for 14 hours. 



Single-extension Sequencing 

FIG. 9 shows that the efficient 35 S incorporation by Tne DNA polymerase 
mutant provides strong signals in single- and double-strand DNA sequencing. 
Alkali-denatured pUC19 DNA (1.5 pmol) was sequenced using single-extension 
sequencing with Tne DNA polymerase of Example 12 as described above (set A); 
film was exposed for only 2 hours. M 1 3 mp 1 9(+) DNA was used at one-tenth the 
normal amount of template (40 pmol) in the Tne DNA polymerase single- 
extension sequencing reactions as described (set B); film exposed for 20 hours. 
Since the Tne mutant produces such a strong signal, templates can be used more 
economically without sacrificing sequence quality. 

FIG. 10 shows that the Tne DNA polymerase mutant generates clear 
sequence from plasmids containing cDNAs with poly(dA) tails. Alkali-denatured 
plasmid DNAs containing cDNA inserts (1.5 pmol) were sequenced using either 
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Set A, P-actin cDNA; set B, RPA1 cDNA (a replication protein); and set C, 
RPA2 cDNA (a replication protein). 

FIG. 1 1 compares the Tne DNA polymerase mutant. Sequenase™ and Taq 
DNA polymerase generated sequences from a plasmid containing poly(dC). 
5 Plasmid DNA (1.5 pmol) containing a poly(dC)-tailed 5' RACE-derived insert 

was alkali denatured. The DNA was sequenced using Tne DNA polymerase 
mutant in single-extension sequencing (set A) as described, Sequenase Ver 2.0™ 
(set B) as described in the kit manual, and by Taq DNA polymerase (set C) 
following the recommended protocol in the TaqTrack kit (Promega, Madison, 
10 WI). 

Cycle Sequencing 

FIG. 12 shows that the Tne DNA polymerase mutant in cycle sequencing 
produces 35 S-labeled sequence 3-fold stronger than Thermo Sequenase™ and 
without the 60-cycle labeling step. Plasmid DNA (0.5 ug) containing a poly(dC)- 

15 tailed 5' RACE-derived insert was cycled sequenced using Tne DNA polymerase 

mutant (set A) as described; film exposure was 6 hours. Using Thermo 
Sequenase™ as described in the kit manual, the plasmid DNA (0.5 ug) was 
labeled with 35 S by partial primer extension using an incubation of 60 cycles. 
This was followed by the standard cycle sequencing protocol in the presence of 

20 chain terminators (set B); film exposure was 18 hours. The plasmid DNA (0.5 

ug) was cycle sequenced using Taq DNA polymerase (set C) as described in the 
fmol kit manual; film exposure was 18 hours. Note, uneven band intensities in 
setC. 

FIG. 13 shows that the Tne DNA polymerase mutant produces high 
25 quality sequences of in vitro amplified DNA. Templates were in vitro amplified 



DNAs (100 fmoi) were cycie sequenced as described using the 1 nc DNA 
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polymerase mutant and one of the amplification primers. Set A, E. coli (3 poll 
(~450bp); set B, E. coli rrsE (-350 bp); set C, ori from pSClOl (-1.5 kb); and set 
D, an exon from human HSINF gene (-750 bp); amplified product sizes in 
parentheses. Note, these DNAs could not be sequenced using Thermo 
5 Sequenase™ because the primers did not meet the extra requirements for the 

labeling reaction. 

FIGS. 14A and 14B show that the Tne DNA polymerase mutant provides 
superior sequence from double-stranded DNA clones containing poly(dA) or 
poly(dC) stretches. Fig. 14 A, supercoiled plasmid DNAs containing inserts with 



described; film exposure was 6 hours. Set A, RPA1; set B, elf (cap binding 

protein); and set C, a poly(dC)-tailed 5' RACE-derived insert. 

FIG. 14B, supercoiled plasmid DNAs containing inserts with 

homopolymers were cycled sequenced using Taq DNA polymerase (set D) in the 
15 fmol kit manual, or SequiTherm™ (sets E-G) following the kit manual; film 

exposure was 18 hours. Set D, RPA; set E, RPA; set F, a poly(dC)-tailed 5' 

RACE-derived insert: and set G. elf. Note, the many false stops, especially in the 

homopolymer region. 

FIG. 15 shows cycle sequencing using the Tne DNA polymerase mutant 
20 and 32 P end-labeled primer. A sequencing primer was first 5 '-end labeled with 

32 P using T4 kinase. A supercoiled plasmid DNA (50 fmol) was cycle sequenced 

using the Tne DNA polymerase mutant as described; film exposure was 1 8 hours. 

The left and right sets are aliquots of the same reaction, the right set loaded on the 

gel 45 minutes after the left. 

25 Fluorescent A utomated Sequencing 
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City, CA) using either the Tne DNA polymerase mutant or AmpliTaq FS™ as 
described. 

FIGS. 17A-17C and 17D-17F show a comparison of the Tne DNA 
polymerase mutant (17A-17C) to AmpliTaq FS™ (17D-17F) in fluorescent dye 
terminator sequencing. pUC19 DNA was sequenced with dye terminators (ABI, 
Foster City, CA) using either the Tne DNA polymerase mutant or AmpliTaq 
FS™ as described. Note, greater evenness of peak heights with Tne. 

These results demonstrate that the Tne DNA polymerase mutant gives 
unexpectedly better results in DNA sequencing compared to other DNA 
polymerases, whether they are similar mutants or not. 

From the foregoing description, one skilled in the art can easily ascertain 
the essential characteristics of this invention, and without departing from the 
spirit and scope thereof, can make various changes and modifications of the 
invention to adapt it to various usages and conditions without undue 
experimentation. All patents, patent applications and publications cited herein 
are incorporated by reference in their entirety. 



WO 97/09451 



PCT/US96/14189 



-62- 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Life Technologies, Inc. 

8717 Groveraont Circle 
Gaithersburg, MD 20884-9980 
APPLICANTS / INVENTORS : Chatterjee, Deb K. 

Hughes , Jr . , A . John 

(ii) TITLE OF INVENTION : Cloned DNA Polymerases from Thermotoga 
and Mutants Thereof 



(iii) NUMBER OF SEQUENCES : 22 



(iv) CORRESPONDENCE ADDKbijS : 

(A) ADDRESSEE: STERNE, KESSLER, GOLDSTEIN & FOX P.L.L.C. 

(B) STREET: 1100 New York Ave., N.W., Suite 600 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Versn 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/525,057 

(B) FILING DATE: 08-SEP-1995 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/537,397 

(B) FILING DATE: 02 -OCT- 1995 

(C) CLASSIFICATION: 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/576,759 

(B) FILING DATE: 21-DEC-1995 

(C) CLASSIFICATION : 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: 14-AUG-1996 

(C) CLASSIFICATION: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Esmond, Robert W. 

(B) REGISTRATION NUMBER: 32,893 

(C) REFERENCE /DOCKET NUMBER: 0942.280PC03 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-371-2600 

(B) TELEFAX: 202-371-2540 



(2) INFORMATION FOR SEQ ID NO : 1 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GAGCTCACGG GGGATGCAGG AAA 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
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ATGGCGAGAC TATTTCTCTT TGATGGCACA GCCCTGGCCT ACAGGG CAT A TTACGCCCTC 6 0 

GACAGATCCC TTTCCACATC CACAGGAATT CCAACGAACG CCGTCTATGG CGTTGCCAGG 12 0 

ATGCTCGTTA AATTCATTAA GGAACACATT AT AC C CG AAA AGGACTACGC GGCTGTGGCC 18 0 

TTCGACAAGA AGGCAGCGAC GTTCAGACAC AAACTGCTCG TAAGCGACAA GGCGCAAAGG 24 0 

CCAAAGACTC CGGCTCTTCT AGTTCAGCAG CTACCTTACA TCAAGCGGCT GATAGAAGCT 3 00 

CTTGGTTTCA AAGTGCTGGA GCTGGAGGGA TACGAAGCAG ACGATATCAT CGCCACGCTT 3 60 

GCAGTCAGGG CTGCACGTTT TTTGATGAGA TTTTCATTAA TAACCGGTGA CAAGGATATG 42 0 

CTTCAACTTG TAAACGAGAA GATAAAGGTC TGGAGAATCG TCAAGGGGAT ATCGGATCTT 4 80 

GATCTTCTAG CACTGACGGG AGACGACATA GACAACATTC CCGGTGTAAC GGGAATAGGT 600 

GAAAAGACCG CTGTAC AG CT TCTCGGCAAG TATAGAAATC TTGAATACAT TCTGGAGCAT 66 0 

GCCCGTGAAC TCCCCCAGAG AGTGAGAAAG GCTCTCTTGA GAGACAGGGA AGTTGCCATC 72 0 

CTCAGTAAAA AACTTGCAAC TCTGGTGACG AACGCACCTG TTGAAGTGGA CTGGGAAGAG 78 0 

AT G AAAT AC A GAGGATACGA CAAGAGAAAA CTACTTCCGA TATTGAAAGA ACTGGAGTTT 84 0 

GCTTCCATCA TGAAGGAACT TCAACTGTAC GAAGAAGCAG AACCCACCGG AT AC G AAAT C 900 

GTGAAGGATC ATAAGAC CTT CGAAGATCTC ATCGAAAAGC TGAAGGAGGT TCCATCTTTT 96 0 

GCCCTGGACC TTGAAACGTC CTCCTTGGAC CCGTTCAACT GTGAGATAGT CGGCATCTCC 102 0 

GTGTCGTTCA AACCGAAAAC AGCTTATTAC ATTCCACTTC ATCACAGAAA CGCCCACAAT 108 0 

CTTGATGAAA CACTGGTGCT GTCGAAGTTG AAAGAGATCC TCGAAGACCC GTCTTCGAAG 114 0 

ATTGTGGGTC AGAACCTGAA GTACGACTAC AAGG TTCTTA TGGTAAAGGG TATATCGCCA 12 00 

GTTTATCCGC ATTTTGACAC GATGATAGCT GC AT ATTTGC TGGAGCCAAA CGAGAAAAAA 126 0 

TTCAATCTCG AAGATCTGTC TTTGAAATTT CTCGGATACA AAATGACGTC TTATCAGGAA 132 0 

CTGATGTCGT TTTCCTCACC ACTTTTTGGT TTCAGCTTTG CGGATGTTCC GGTAGACAAG 13 8 0 

GCTGCCGAAT ACTCCTGCGA GGATGCAGAC ATCACTTATA GGCTCTACAA GATACTCAGC 144 0 
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GGTGAGCCCT TCAACATCAA TTCTCCAAAA CAGGTTTCAA ACATCCTTTT TGAGAAG CTG 168 0 

GGAATAAAAC CCCGTGGAAA AACGACAAAA ACAGGAGATT ACTCTACCAG GATAGAGGTG 174 0 

TTGGAAGAGA TAGCGAATGA GCACGAGATA GTACCCCTCA TTCTCGAGTT CAGAAAGATC 18 00 

CTGAAACTGA AATCGACCTA CATAGACACC CTTCCGAAAC TTGTGAACCC GAAAACCGGA 186 0 

AGATTTCATG CATCTTTCCA CCAGACGGGT ACCGCCACTG GCAGGTTGAG TAGCAGTGAT 192 0 

CCAAATCTTC AGAATCTTCC GACAAAGAGC GAAGAGGGAA AAGAAATTAG AAAAGCGATT 19 80 

GTGCCCCAGG ATCCAGACTG GTGGATCGTC AGTGCGGATT ATTCCCAAAT AGAACTCAGA 2 04 0 

ATCCTCGCTC ATCTCAGTGG TGATGAGAAC CTTGTGAAGG CCTTCGAGGA GGGCATCGAT 2100 

GTGCACACCT TGACTGCCTC CAGGATCTAC AACGTAAAGC f AG AAGAAGT GAACGAAGAA 216 0 

ATGCGACGGG TTGGAAAGAT GGTGAACTTC TCTATAATAT ACGGTGTCAC ACCGTACGGT 22 2 0 

CTTTCTGTGA GACTTGGAAT ACCGGTTAAA G AAG C AG AAA AGATGATTAT CAGCTATTTC 22 8 0 

ACACTGTATC CAAAGGTGCG AAGCTACATC CAGCAGGTTG TTGCAGAGGC AAAAGAGAAG 2 34 0 
GGCTACGT C A GGACTCTCTT TGGAAGAAAA AGAGATATTC CCCAGCTCAT GGCAAGGGAC 
AAGAACACCC AGTCCGAAGG CGAAAGAATC GCGATAAACA CCCCCATTCA GGGAACTGCG 

G C AG AT AT AA TAAAATTGGC TATGATAGAT ATAGACGAGG AG CTG AGAAA AAGAAACATG 2 52 0 

AAATCCAGAA TGATCATTCA GGTTCATGAC GAACTGGTCT TCGAGGTTCC CGATGAGGAA 2 58 0 

AAAGAAGAAC TAGTTGATCT GGTGAAGAAC AAAATGACAA ATGTGGTGAA ACTCTCTGTG 2 64 0 

CCTCTTGAGG TTGACATAAG CATCGGAAAA AGCTGGTCTT GA 2 6 82 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



2400 
2460 
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Tyr Tyr Ala Leu Asp 

20 

Asn Ala Val Tyr Gly 
35 

His lie lie Pro Glu 
50 

Ala Ala Thr Phe Arg 
65 

Pro Lys Thr Pro Ala 
85 

Leu lie Glu Ala Leu 



Ala Asp Asp lie lie 
115 

Met Arg Phe Ser Leu 
130 

Asn Glu Lys lie Lys 
145 

Glu Leu Tyr Asp Ser 
165 

His Gin lie Pro Asp 
180 

lie Pro Gly Val Thr 
195 

Gly Lys Tyr Arg Asn 
210 

Pro Gin Arg Val Arg 
225 

Leu Ser Lys Lys Leu 
245 



-66- 



Arg Ser Leu Ser Thr Ser 
25 

Val Ala Arg Met Leu Val 
40 

Lys Asp Tyr Ala Ala Val 
55 

His Lys Leu Leu Val Ser 
70 75 

Leu Leu Val Gin Gin Leu 
90 

Gly Phe Lys Val Leu Glu 
105 

Ala Thr Leu Ala Val Arg 
120 

lie Thr Gly Asp Lys Asp 
135 

Val Trp Arg lie Val Lys 
150 155 

Lys Lys Val Lys Glu Arg 
170 

Leu Leu Ala Leu Thr Gly 
185 

Gly He Gly Glu Lys Thr 
200 

Leu Glu Tyr He Leu Glu 
215 

Lys Ala Leu Leu Arg Asp 
230 235 

Ala Thr Leu Val Thr Asn 
250 



Thr Gly He Pro Thr 

30 

Lys Phe He Lys Glu 
45 

Ala Phe Asp Lys Lys 
60 

Asp Lys Ala Gin Arg 
80 

Pro Tyr He Lys Arg 
95 

Leu Glu Gly Tyr Glu 
110 

Ala Ala Arg Phe Leu 
125 

Met Leu Gin Leu Val 
140 

Gly He Ser Asp Leu 
160 

Tyr Gly Val Glu Pro 
175 

Asp Asp He Asp Asn 
190 

Ala Val Gin Leu Leu 
205 

His Ala Arg Glu Leu 

220 

Arg Glu Val Ala He 

240 

Ala Pro Val Glu Val 
255 



Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
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Lys Thr Phe Glu Asp Leu lie Glu Lys Leu Lys Glu Val Pro Ser Phe 

305 310 315 320 

Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 

325 330 335 

Val Gly He Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 

340 345 350 

Leu His His Arg Asn Ala His Asn Leu Asp Glu Thr Leu Val Leu Ser 

355 360 365 

Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser Lys He Val Gly Gin 

370 375 380 

Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly He Ser Pro 

385 390 395 400 

Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 



Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 430 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu 
435 440 445 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Glu Tyr 
450 455 460 

Ser Cys Glu Asp Ala Asp lie Thr Tyr Arg Leu Tyr Lys He Leu Ser 
465 470 475 480 

Met Lys Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg He Glu 
485 490 495 

Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Phe Asn Trp Val Tyr 
500 505 510 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
515 520 525 

Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He Ala Gly Glu Pro Phe 
530 535 540 



Asn He Asn Ser Pro Lys Gin Val Ser Asn He Leu Phe Glu Lys Leu 
545 550 555 560 
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Leu lie Leu Glu Phe Arg Lys lie Leu Lys Leu Lys Ser Thr Tyr He 
595 600 605 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg Phe His Ala 
610 615 620 

Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
625 630 635 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu He 
645 650 655 

Arg Lys Ala He Val Pro Gin Asp Pro Asp Trp Trp He Val Ser Ala 
660 665 670 

Asp Tyr Ser Gin He Glu Leu Arg He Leu Ala His Leu Ser Gly Asp 
675 6B0 SS5 

Glu Asn Leu Val Lys Ala Phe Glu Glu Gly He Asp Val His Thr Leu 
690 695 700 

Thr Ala Ser Arg He Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 
705 710 715 720 

Met Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 
725 730 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly He Pro Val Lys Glu Ala 
740 745 750 

Glu Lys Met He He Ser Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
755 760 765 

Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 



Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 

785 790 795 800 

Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala He Asn Thr Pro He 

805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Asp He Asp 
820 825 830 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met He He Gin Val 
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Pro Leu Glu Val Asp lie Ser lie Gly Lys Ser Trp Ser 
885 890 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 : 

Pro Ser Phe Ala Leu Asp Leu Glu Thr Ser Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Pro Val Phe Ala Phe Asp Thr Glu Thr Asp Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
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Gly Pro Val Ala Phe Asp Ser Glu Thr Ser Ala 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(XX) SEQUiiNUfc- INSCRIPTION : SEQ ID NO : 7 : 
Met lie Val Ser Asp lie Glu Ala Asn Ala 

a 5 io 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GACGTTTCAA GCGCTAGGGC AAAAGA 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Lys Met Val Asn Phe Ser lie He Tyr Gly 
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1 5 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys Ala He Asn Phe Gly Leu He Tyr Gly 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 
Lys Ala He Thr Phe Gly He Leu Tyr Gly 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



Lys Thr Phe He Tyr Gly Phe Leu Tyr Gly 
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15 10 
(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
GTATATTATA GAGTAGTTAA CCATCTTTCC A 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



GTAGGCCAGG GGCTGTGCCG GCAAAGAGAA ATAGTC 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GAAGGATATC CTTGGCGCCG GTTATTATGA AAATC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATGGCGAGAC TATTTCTCTT TGATGGCACA GCCCTGGCCT ACAGGGCATA TTACGCCCTC 6 0 

GACAGATCCC TTTCCACATC CACAGGAATT CCAACGAACG CCGTCTATGG CGTTGCCAGG 12 0 
ATGCTCGTTA AATTCATTAA GGAACACATT ATACC CGAAA AGGACTACGC GGCTGTGGCC 
TTCGACAAGA AGGCAGCGAC GTTCAGACAC AAACTGCTCG TAAGCGACAA GGCGCAAAGG 
CCAAAGACGC CGGCTCTTCT AGTTCAGCAG CTACCTTACA TCAAGCGGCT GATAGAAGCT 
CTTGGTTTCA AAGTGCTGGA GCTGGAAGGG TACGAAGCAG ACGATATCAT CGCCACGCTT 

G C AG CAAAGG GCTGCACGTT TTTTGATGAG ATTTTCATAA TAACCGGTGA CAAGGATATG 42 0 

CTTCAACTTG TAAACGAGAA GATAAAGGTC TGGAGAATCG T CAAGGGG AT ATCGGATCTT 48 0 

GAGCTTTACG ATTCGAAAAA GGTGAAAGAA AG AT AC GGTG TGGAACCACA TCAGATACCG 54 0 



180 
240 
300 
360 



GCCCGTGAAC TCCCCCAGAG AGTGAGAAAG GCTCTCTTGA GAGACAGGGA AGTTGCCATC 
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CTCAGTAAAA AACTTG CAAC TCTGGTGACG AACGCACCTG TTGAAGTGGA CTGGGAAGAG 
ATGAAATACA GAGGATACGA CAAGAGAAAA CTACTTCCGA TATTGAAAGA ACTGGAGTTT 
GCTTC CATCA TGAAGGAACT TCAACTGTAC GAAGAAGCAG AACCCACCGG AT AC G AAAT C 
GTGAAGGATC ATAAGACCTT CGAAGATCTC ATCGAAAAGC TGAAGGAGGT TCCATCTTTT 
GCCCTGGACC TTGAAACGTC CTCCTTGGAC CCGTTCAACT GTGAGATAGT CGGCATCTCC 
GTGTCGTTCA AACCGAAAAC AG CTT ATT AC ATTCCACTTC ATCACAGAAA CGCCCACAAT 
CTTGATGAAA CACTGGTGCT GTCGAAGTTG AAAGAGATCC TCGAAGACCC GTCTTCGAAG 
ATTGTGGGT C AGAACCTGAA GTACGACTAC AAGGTTCTTA TGGTAAAGGG TATATCGCCA 
GTTTATCCGC ATTTTGACAC GATGATAGCT GCATATTTGC TGGAGCCAAA CGAGAAAAAA 
TTCAATCTCG AAGATCTGTC TTTGAAATTT CTCGGATACA AAATGACGTC 



780 
840 
900 
960 
1020 
1080 
1140 
1200 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43S amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l8: 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 
1 5 10 15 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly He Pro Thr 
20 25 30 

Asn Ala Val Tyr Gly Val Ala Arg Met Leu Val Lys Phe He Lys Glu 
35 40 45 

His He He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lys 
50 55 60 

Ala Ala Thr Phe Arg His Lys Leu Leu Val Ser Asp Lys Ala Gin Arg 



Leu He Glu Ala Leu Gly Phe Lys 
100 



Val Leu Gli 



Leu Glu Gly 
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Ala Asp Asp He He Ala Thr Leu Ala Ala Lys Gly Cys Thr Phe Phe 
115 120 125 

Asp Glu He Phe He He Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
130 135 140 

Asn Glu Lys He Lys Val Trp Arg He Val Lys Gly He Ser Asp Leu 
145 150 155 160 

Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 
165 170 175 

His Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Asp He Asp Asn 
180 185 190 

He Pro Gly Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 ^UU 20S 

Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 215 220 

Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 
225 230 235 240 

Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro val Glu Val 
245 250 255 

Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
260 265 270 

Pro He Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 

Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu He Val Lys Asp His 
290 295 300 

Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 
305 310 315 320 

Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 

Val Gly He Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 
340 345 350 

Leu His His Arg Asn Ala His Asn Leu Asp Glu Thr Leu Val Leu Ser 
355 360 365 
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Val Tyr Pro His Phe Asp Thr Met lie Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 430 

Tyr Lys Met Thr 
435 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TCGTACCNGG GNTCNCNANA TCGACTGCAG CATGCAAGCT GGCTAATCAT GGTCATAGCT 
GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG CCGGAAGCAT 
AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG CGTTGCGCTC 
ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA TCGGCCAACG 
CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA CTGACTCGCT 
GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG TAATACGGTT 
ATCCACAGAA TCAGGGGATA AC G CAGG AAA GAACATGTGA GCAAAAGGCC AGCAAAAGGC 
CAGGAACCGT TAAAAAGGCC GCGTTGCTGG GCGTTTTTCC ATAGGCTCCG CCCCCCTTGA 
CGAGCATCAC AAAAATTCGA CGCTTCAAGT TCAGAGGTGG GCGAAACCCG ACAGGGACTA 54 0 

TAAAGATTAC CAGGGCGTTT TCCCCCTGGG AAGCTNCCTT CGTGCGCTCT CCTGTTCCCG 



120 
180 
240 
300 
360 
420 
480 



600 



(2) INFORMATION FOR SEQ ID NO : 2 0 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCTCGTACCG GGGATCTNNN ANATCGACTG CAGCATGCAA GCTTGGCGTA ATCATGGTCA 
TAGCTGTTTC CTGTGTGAAA TTGTTATCCG CTCACAATTC CACACAACAT ACGAGCCGGA 
AGCATAAAGT GTAAAGCCTG GGGTGCCTAA TGAGTGAGCT AACTCACATT AATTGCGTTG 
CGCTCACTGC CCGCTTTCCA GTCGGGAAAC CTGTCGTGCC AGCTGCATTA ATGAATCGGC 
CAACGCGCGG GGAGAGGCGG TTTGCGTATT GGGCGCTCTT CCGCTTCCTC GCTCACTGAC 
TCGCTGCGCT CGGTCGTTCG GCTGCGGCGA GCGGTATCAG CT C ACTC AAA GGCGGTAATA 
CGGTTATCCA CAGAATCAGG GGATAACGCA GGAAAGAACA TGTGAGCAAA AG G C C AG C AA 
AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGGCGTTT TTTCCATAGG CTCCGCCCCC 4 80 

CTGANGAGCA TCANAAAAAT CGANGCTCAN GTCANAGGTG GCGAAACCCG ACAGGNCTAT 54 0 

TAAAAGATNC CCAGGCGTTT TCCCCCCTGG GAAGCTCCCT CGTGGGGCTC TCCTGGTTNC 
GGNNCCCTGN CCGGNTTACC GGGGATAANC TTGTTCCGGN CTTTNTCCCC TTCNGGGAAA 
ANGGTGGGGG GTTTTNTNNA AAAGGCTCAA AGGCTGGTAN G 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



120 
180 
240 
300 
360 
420 



600 
660 
701 



G NTNT AG ' INN GGNCTAANNG GCGGGGAAAT CGAGCTCGGT ACCC 
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CGACCTGCAG GCATGCAAGC TTGGCGTAAT CATGGTCATA GCTGTTTCCT GTGTGAAATT 120 

GTTATCCGCT CACAATTCCA CACAACATAC GAGCCGGAAG CATAAAGTGT AAAG CCTGGG 180 

GTGCCTAATG AG TG AG CT AA CTCACATTAA TTGCGTTGCG CTCACTGCCC GCTTTCCAGT 240 

CGGGAAACCT GTCGTGCCAG CTGCATTAAT GAATCGGCCA ACGCGCGGGG AGAGGCGGTT 3 00 

TGCGTATTGG GCGCTCTTCC GCTTCCTCGC TCACTGACTC GCTGCGCTCG GTCGTTCGGC 360 

TGCGGCGAGC GGTATCAG CT CACTCAAAGG CGGTAATACG GTTATCCACA GAAATCAGGG 420 

GATAACG C AG GGAAAGAACA TGTGAGCAAA AGGCCCAGCA AAAGGCCAGG AACCCGTAAA 480 

AAGGCCGCGT TGCCTGGCGT TTTTCCATAG GCTCCGCCCC CCTTGACGAG CAATCACAAA 54 0 

AATCGACGCT CAAAGTCAAG AGGTGGCGAA ACL'CCGACAG GGACTTATAA AG AT AC C C AG SCO 

GCCGTTTCCC CCTGGAAGCT CCCCTCCGTG CGCTTCTCCT TGGTTCCCGA CCCTGCCGCT 66 0 

TTACCNGGAT NCCTGTCCGC CCTTTTNTCC CTTTCNGGNA ACCGGGCGCT TTTTTTT 717 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
NNNNCNNNNG GCTGANAGCG ATAAATCGAG CTCGGTACCC GGGGATCCTC TAGAGTCGAC 
CTGCAGGCAT GCAAGCTTGG CGTAATCATG GTCATAGCTG TTTCCTGTGT GAAATTGTTA 
TCCGCTCACA ATTCCACACA ACATACGAGC CGGAAGCATA AAGTGTAAAG CCTGGGGTGC 
CTAATGAGTG AGCTAACTCA CATTAATTGC GTTGCGCTCA CTGCCCGCTT TCCAGTCGGG 
AAACCTGTCG TGCCAGCTGC ATTAATGAAT CGGCCAACGC GCGGGGAGAG GCGGTTTGCG 
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CGTTTGCTGG CGTTTTTCCC ATAGGCTCCG CCCCCCTTGA CGAACCATCA CAAAAATCGA 54 0 

CGCTCAATTC AGAAGTTGGC GAAAACCCGA CAGGACTAAT AAAGATACCC AGCGTTTCCC 6 00 

CCCCTGGAAA CTCCCCTCCG TTGCGCCTCT CCCTGTTCCC GAACCTTGCC CGCTTACCGG 66 0 

GAATACCTTG TCCNCCTTTT CTCCCCTTCC GGGAANCGTT NGCGCCTTTC CCC 713 
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What Is Claimed Is: 



1 . A Thermotoga DNA polymerase mutant which is modified at least 
one way selected from the group consisting of 

(a) to reduce or eliminate the 3 '-5' exonuclease activity of the 

polymerase; 

(b) to reduce or eliminate the 5 '-3 ' exonuclease activity of the 
polymerase; and 

(c) to reduce or eliminate discriminator}' behavior against a 
dideoxynucleotide. 

2. The DNA polymerase mutant of claim 1 , which is modified at 
least two ways. 

3. The DNA polymerase mutant of claim 1 , which is modified three 

ways. 

4. The DNA polymerase mutant of claim 1. which comprises a 
mutation in the O-helix of said DNA polymerase resulting in said DNA 
polymerase becoming non-discriminating against dideoxynucleotides. 

5 . The DNA polymerase of claim 4. wherein said O-helix is defined 
as RXXXKXXXFXXXYX, wherein X is any amino acid. 

6. The DNA polymerase of claim 5, wherein said mutant DNA 
polymerase is a Tne DNA polymerase. 
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8. The DNA polymerase mutant of claim 1, wherein said DNA 
polymerase is a Tne DNA polymerase having substantially reduced 3 '-5' 
exonuclease activity. 

9. The mutant Tne DNA polymerase as claimed in claim 8. wherein 
5 said mutant is a Asp 323 -* Ala 323 substitution. 

1 0. The mutant Tne DNA polymerase as claimed in claim 6. wherein 
said mutant polymerase comprises both a Phe 730 -*Tyr 730 substitution and a 
Asp J - J -*-Ala J/J substitution. 

1 1. The mutant DNA polymerase mutant of claim 1. wherein said 
10 DNA polymerase is a Tne DNA polymerase having substantially reduced 5 '-3' 

exonuclease activity. 

12. The mutant Tne DNA polymerase as claimed in claim 1 1 . wherein 
said mutant polymerase has a deletion mutation in the N-terminal 5 '-3' 
exonuclease domain. 



15 13. The mutant Tne DNA polymerase as claimed in claim 12. wherein 

said mutant polymerase is devoid of the 219 N-terminal amino acids. 

14. The DNA polymerase as claimed in claim 1, which is a mutant 
Tma polymerase. 



15. The DNA polymerase as claimed in claim 14, wherein said 
20 polymerase has a Phe 730 -*Tyr 730 substitution. 



has substantially reduced 3 '-5' exonuclease activity. 
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1 7. The mutant Tma DNA polymerase as claimed in claim 1 6. wherein 
said mutant is a Asp 323 -* Ala 323 substitution. 

1 8. The mutant Tma DNA polymerase as claimed in claim 14. wherein 
said mutant polymerase comprises both a Phe 730 ->Tyr 730 substitution and a 
Asp 323 -* Ala 323 substitution. 

19. The mutant DNA polymerase mutant of claim 14, wherein said 
DNA polymerase is a Tma DNA polymerase having substantially reduced 5'-3' 
exonuclease activity. 

20. The mutant Tma DNA polymerase as claimed in claim 1 9, wherein 
said mutant polymerase has a deletion mutation in the N-terminal 5 '-3' 
exonuclease domain. 

2 1 . The mutant Tma DNA polymerase as claimed in claim 20, wherein 
said mutant polymerase is devoid of the 219 or less N-terminal amino acids. 

22. A vector comprising a gene encoding the DNA polymerase of 
claim 1. 

23. The vector of claim 22, wherein said gene is operably linked to a 
promoter. 

24. The vector of claim 23, wherein said promoter is selected from the 
group consisting of a A.-P L promoter, a tac promoter, a trp promoter, and a trc 
promoter. 
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26. A method of producing a DNA polymerase, said method 
comprising: 

(a) culturing the host cell of claim 25; 

(b) expressing said gene; and 

(c) isolating said DNA polymerase from said host cell. 

27. The method of claim 26, wherein said host cell is E. coli. 



28. A method of synthesizing a double-stranded DNA molecule 
comprising: 

(a) hybridizing a primer to a first DNA molecule; and 
10 (b) incubating said DNA molecule of step (a) in the presence 

of one or more deoxy- or dideoxyribonucleoside triphosphates and the DNA 
polymerase of claim 1, under conditions sufficient to synthesize a second DNA 
molecule complementary to all or a portion of said first DNA molecule. 

29. The method of claim 28. wherein said deoxy- or 
15 dideoxyribonucleoside triphosphates are selected from the group consisting of 

dATP, dCTP. dGTP. dTTP. dITP. 7-deaza-dGTP, 7-deaza-dATP, dUTP. ddATP, 
ddCTP, ddGTP. ddlTP, ddTTP, [a-S]dATP, [a-S]dTTP, [a-S]dGTP, and 
[a-S]dCTP. 



30. The method of claim 28, wherein one or more of said 
20 deoxyribonucleoside triphosphates are detectably labeled. 



31. A method of sequencing a DNA molecule, comprising: 

' (-^ '^.-bridi7inn n. primer to n fir^t DNA molecule' 



.:coxvnbonucieoskiv' triphosphates, the DNA polymerase 
25 terminator nucleotide; 
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(c) incubating the mixture of step (b) under conditions 
sufficient to synthesize a random population of DNA molecules complementary 
to said first DNA molecule, wherein said synthesized DNA molecules are shorter 
in length than said first DNA molecule and wherein said synthesized DNA 

5 molecules comprise a terminator nucleotide at their 3 ' termini; and 

(d) separating said synthesized DNA molecules by size so that 
at least a part of the nucleotide sequence of said first DNA molecule can be 
determined. 

32. The method of claim 31, wherein said deoxyribonucleoside 
10 triphosphates are selected from the group consisting of dATP, dCTP, dGTP, 

dTTP, dITP, 7-deaza-dGTP, 7-deaza-dATP, dUTP, [a-S]dATP, [a-S]dTTP, 
[a-S]dGTP, and [a-S]dCTP. 

33. The method of claim 31, wherein said terminator nucleotide is 
ddTTP, ddATP, ddGTP or ddCTP. 

15 34. The method of claim 31, wherein one or more of said 

deoxyribonucleoside triphosphates is detectably labeled. 

35. The method of claim 3 1 , wherein one or more of said terminator 
nucleotides is detectably labeled. 

36. The method of claim 31, wherein said primer is detectably labeled. 



20 37. A method for amplifying a double stranded DNA molecule, 

comprising: 
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of said DNA molecule and said second primer is complementary to a sequence 
at or near the 3 '-termini of the second strand of said DNA molecule; 

(b) hybridizing said first primer to said first strand and said 
second primer to said second strand in the presence of the DNA polymerase of 
claim 1 , under conditions such that a third DNA molecule complementary to said 
first strand and a fourth DNA molecule complementary to said second strand are 
synthesized; 

(c) denaturing said first and third strand, and said second and 
fourth strands; and 

(d) repeating steps (a) to (c) one or more times. 

38. The method of claim 37, wherein said deoxyribonucleoside 
triphosphates are selected from the group consisting of dATP, dCTP, dGTP, 
dTTP, dITP, 7-deaza-dGTP, 7-deaza-dATP, dUTP, [a-S]dATP, [a-S]dTTP, 
[a-S]dGTP, and [a-S]dCTP. 

39. A kit for sequencing a DNA molecule, comprising: 

(a) a first container means comprising the DNA polymerase 

of claim 1; 

(b) a second container means comprising one or more 
dideoxyribonucleoside triphosphates; and 

(c) a third container means comprising one or more 
deoxyribonucleoside triphosphates. 

40. A kit for amplifying a DNA molecule, comprising: 

(a) a first container means comprising the DNA polymerase 

of claim 1 ; and 



■ox\ 1 1 r hi nuci cumuli; inpnohpnate: 
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41. A mutant DNA polymerase having substantially reduced or 
eliminated 5 '-3' exonuclease activity, wherein at least one of the amino acids 
corresponding to Asp 8 , Glu" 2 , Asp 114 , Asp 115 , Asp 137 , Asp 139 , Gly 102 , Gly 187 , or 
Gly 195 of Tne DNA polymerase has been mutated. 

5 42. The mutant DNA polymerase of claim 41 , which is a mutant E. 

coli poll, Tag, Tne, or Tma DNA polymerase. 



43. The mutant DNA polymerase of claim 42, which is a Tne DNA 
polymerase. 

44. The mutant DNA polymerase of claim 43, which is a Tma DNA 
10 polymerase. 

45 . A vector coding for the mutant DNA polymerase of claim 4 1 . 



46. A host cell comprising the vector of claim 45. 



47. A method of producing a mutant DNA polymerase having 
substantially reduced or eliminated 5'-3' exonuclease activity, wherein at least 
15 one of the amino acids corresponding to Asp 8 , Glu 112 , Asp" 4 , Asp 115 , Asp 137 , 

Asp 139 , Gly' 02 , Gly 187 , or Gly 195 of Tne DNA polymerase has been mutated, 
comprising 

(a) culturing the host cell of claim 46; 

(b) expressing the mutant DNA polymerase; and 
20 (c) isolating said mutant DNA polymerase. 



complementary primer to form a hybrid, and 
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(b) contacting said hybrid formed in step (a) with the Tne 
DNA polymerase of claim 1 and dATP, dCTP, dGTP and dTTP, 
whereby a cDNA-RNA hybrid is obtained. 

49. A method of preparing dsDNA from mRNA, comprising 
5 (a) contacting mRNA with an oligo(dT) primer or other 

complementary primer to form a hybrid, and 

(b) contacting said hybrid formed in step (a) with the Tne 
DNA polymerase claim 1. dATP, dCTP, dGTP and dTTP, and an oligonucleotide 
which is complementary to the first strand cDNA; 
10 whereby dsDNA is obtained. 
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BomHI 

1 GGATCCAGAC TGGTGGATCG TCAGTGCGGA TTATTCCCAA ATAGAACTCA GAATCCTCGC 
GSR LVD RQCG LFP NRT QNPR 

— DPO W W I VSA DYSQ I E L RIL 

IQ TGGS SVR IIP K-NS ESS 

61 TCATCTCAGT GGTGATGAGA ACCTTGTGAA GGCCTTCGAG GAGGGCATCG ATGTGCACAC 
SSQ W - - EPCE GLR GGH RCAH 

— A HLS GDE NLV KAFE EGI DVH 

LIS VVMR TL - RPS RRASMCT 

121 CTTGACTGCC TCCAGGATCT ACAACGTAAA GCCAGAAGAA GTGAACGAAG AAATGCGACG 
LDC LOD LQRK ARR SER RNAT 
— m- T LTA SRI Y N V KPEE VNE EMR 
P-l_ PPGS TT - SQKK - TKKCD 

181 GGTTGGAAAG ATGGTGAACT TCTCTATAAT ATACGGTGTC ACACCGTACG GTCTTTCTGT 

G W K D G E L L Y N IRC H T V R S F C 

_^R V G K M V N ©S I I Y G V T P Y G L S 

G L E R W - T S L - Y T V S H R T V F L 

241 GAGACTTGGA ATACCGGTTA AAGAAGCAGA AAAGATGATT ATCAGCTATT TCACACTGTA 
ETW NTG - RSR KDD YQL FHTV 
■— ~V RLG IPV KEA EKMI I S Y FTL 
-DL EYRL KKQ K R - LSAI SHC 

301 TCCAAAGGTG CGAAGCTACA TCCAGCAGGT TGTTGCAGAG GCAAAAGAGA AGGGCTACGT 

S K G A K L H P A G C C R G K R E G L R 

— PKV RSY IQQ VVAE AKE KGY 

IQR CEAT SSR LLQ RQKR RAT 

361 CAGGACTCTC TTTGGAAGAA AAAGAGATAT TCCCCAGCTC ATGGCAAGGG ACAAGAACAC 

QDS LWK KKRY SPA HGK GQEH 

_^V RTL FGR KRD IPQL MAR DKN 

S G L S L E E K E I F P S S W Q G T R T 

421 CCAGTCCGAA GGCGAAAGAA TCGCAATAAA CACCCCCATT CAGGGAACTG CGGCAGATAT 
PVR RRK NRNK HPH SGN CGRY 

— T QSE GER IAI NTPI QGT AAD 

P. S P K A K E S Q - T P P F R E L R Q I 



FIG.5A 
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481 AATAAAATTG GCTATGATAG ATATAGACGA GGAGCTGAGA AAAAGAAACA TGAAATCCAG 
N K 1 G Y D R Y R R G A E K K K H E I Q 

[ K L AMI DID E E L R K R N M K S 
__ N w L - - I-T RS - EKET - NP 

541 AATGATCATT CAGGTTCATG ACGAACTGGT CTTCGAGGTT CCCGATGAGG AAAAAGAAGA 
N D H S G S - R T G L R G S R - G K R R 
— ^R Mil Q V H DEL V F E V P D E EKE 
E -,S F R F M T N W S S R F P M R K K K 

601 ACTAGTTGAT CTGGTGAAGA ACAAAATGAC AAATGTGGTG AAACTCTCTG TGCCTCTTGA 
TS - SGE EQND KCG ETL C A S - 
r ■ w n l v K N K M T N V V K L S V P I 
L N L - \ U I 'w V - ' R "t K - QMW - N S L C L L 

661 GGTTGACATA AGCATCGGAA AAAGCTGGTC TTGA 

G - H K H R K K L V L 
— ^E V D 1 S I G K S W S - 
R L T - A S E K A G L 



FIG.5B 
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